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Abstract 


Selection  and  ranking  (more  broadly  multiple  decision)  problems  arise  in 
many  practical  situations  where  the  so-called  tests  of  homogeneity  do  not  provide 
the  answers  the  experimenter  wants. 

The  logistic  distribution  has  been  applied  in  studies  of  population  growth, 
of  mental  ability,  of  bio-assay,  of  life  test  data  and  of  biochemical  data,  but  the 
complete  distribution  of  the  sample  means  and  variances  of  a  logistic  population 


has  not  been  obtained  yet. 

-;A-  i  ti 

'  hr  this  paper -we  study  the  selection  and  ranking  problems  for  logistic  popu¬ 


lations  and  an  elimination  type  two-stage  procedure  for  selecting  the  best  popu- 

Z'  i  ->  4  C 

lation  using  a  restricted  subset  selection  rule  in  its  first  stage.  <  v  '  r 


Chapter  2  deals  with  the  selection  and  ranking  procedures  for  logistic  pop¬ 
ulations.  An  excellent  approximation  to  the  distribution  of  the  sample  means 

i;  , 

from  a  logistic  population  is  derived  by  using  the  Edgeworth  series  expansions.  rr>  -  - 

^  C  j 

Using  this  approximation,  we  propose  and  study  a  single-stage  procedure  using  C  ’r  ■ 
the  indifference  zone  approach,  two  subset  selection  rules  based  on  sample  means 
and  medians  respectively  for  selecting  the  population  with  the  largest  mean  from 
k  logistic  populations  when  the  common  variance  is  known. 

Chapter  3  considers  an  elimination  type  two-stage  procedure  for  selecting  the 
population  with  the  largest  mean  from  k  logistic  populations  when  the  common 
variance  is  known.  A  table  of  the  constants  needed  to  implement  this  procedure  is 
provided  and  the  efficiency  of  this  procedure  relative  to  the  single-stage  procedure 
is  investigated. 
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Chapter  4  deals  with  a  single-stage  restricted  subset  selection  procedure  for 
selecting  the  population  with  the  largest  mean  from  k  logistic  populations  when 
the  common  variance  is  known.  Some  properties  of  this  procedure  such  as  mono- 
tonidty  and  consistency  are  investigated.  Tables  of  required  sample  sizes  for  this 
procedure  are  provided.  A  new  design  criterion  to  get  the  needed  sample  size 
and  the  constant  defining  this  procedure  simultaneously  is  proposed  and  a  table 
of  these  constants  is  given. 

Chapter  5  deals  with  a  more  flexible  two-stage  procedure  for  selecting  the 
best  population,  which  uses  a  restricted  subset  selection  rule  in  its  first  stage 
and  the  Bechhofer’s  (1954)  natural  decision  procedure  in  the  second  stage,  in 
terms  of  a  set  of  consistent  estimators  of  the  real  population  parameters,  whose 
distributions  form  a  stochastically  increasing  family  for  a  given  sample  size. 

KEY  WORDS:  Selection  and  Ranking,  Restricted  Subset  Selection  Procedure,  Two- 
Stage  Procedure,  Largest  Mean,  Subset  Selection,  Logistic  Populations. 
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1  INTRODUCTION 


It  is  not  uncommon  that  we  face  a  problem  of  making  decisions  regarding  k  given 
populations,  for  example,  different  varieties  of  wheat  in  an  agricultural  experiment,  or 
different  competing  designs  of  engines  to  be  used  in  an  automobile  plant,  or  different 
drugs  for  a  certain  ailment.  Suppose  9\, . . . ,  Ok  are  the  characteristics  or  parameters  of 
the  populations  in  which  the  experimenter  is  interested.  The  classical  approach  in  the 
preceding  problems  has  been  to  test  the  so-called  homogeneity  hypothesis  Ha  :  0X  = 
. . .  =  Ok-  However  the  experimenter’s  real  goal  often  is  to  identify  the  best  population 
(the  variety  with  the  largest  average  yield,  the  most  reliable  system  and  so  on).  Then 
the  test  of  H0  is  unrealistic  for  this  problem.  Attempts  were  made  to  overcome  the 
shortcomings  of  the  classical  tests  of  homogeneity  by  formulating  the  problem  in  a 
more  meaningful  and  realistic  way.  A  partial  answer  was  provided  by  Mosteller  (1948) 
who  tested  homogeneity  against  slippage  alternatives.  Paulson  (1949),  Bahadur  (1950) 
and  Bahadur  and  Robbins  (1950)  are  among  the  early  investigators  to  recognize  the 
shortcomings  of  the  classical  test  of  homogeneity  hypothesis  and  to  formulate  the  k- 
population  problem  as  a  multiple  decision  problem  in  the  framework  of  what  have  now 
come  to  be  known  as  selection  and  ranking  procedures. 

The  two  main  approaches  that  have  been  used  in  formulating  a  selection  and  ranking 
problem  are  familiarly  known  as  the  indifference  zone  approach  and  the  subset  selection 
approach.  The  basic  problem  in  the  indifference  zone  approach,  due  to  Bechhofer 
(1954),  is  to  select  one  of  the  k  populations  with  a  guarantee  that  the  probability  of 
selecting  the  best  population  is  at  least  a  fixed  probability  Pm(l/k  <  P*  <  1)  whenever 
the  unknown  parameters  lie  outside  some  subspace  of  the  parameter  space,  the  so- 
called  indifference  zone  (the  complement  of  an  indifference  zone  is  called  a  preference 
zone).  Here  some  knowledge  of  the  parameter  space  is  assumed  known  a  priori,  for 
example,  the  experimenter  must  be  able  to  guarantee  that  the  largest  parameter  is 
separated  from  all  other  ranking  parameters  by  a  distance  not  less  than  6,  say.  Other 
contributions  to  this  approach  are  Bechhofer  and  Sobel  (1954),  Bechhofer,  Dunnett 
and  Sobel  (1954),  Sobel  and  Huyett  (1957),  Sobel  (1967),  Bechhofer,  Kiefer  and  Sobel 


(1968),  Mahamunulu  (1967),  Desu  and  Sobel  (1968,1971)  and  Tamhane  and  Bechhofer 
(1977,1979)  among  others.  There  are  several  variations  and  generalizations  of  the  basic 
goal  discussed  above.  For  details,  reference  can  be  made  to  Gupta  and  Panchapakesan 
(1979)  and  Dudewicz  and  Koo  (1982). 

In  the  subset  selection  approach  known  as  “Gupta’s  formulation”  for  selecting  the 
best  population,  the  goal  is  to  select  a  nonempty  subset  of  the  k  populations  so  that 
the  best  population  is  included  in  the  selected  subset  with  a  minimum  guaranteed 
probability  JP*(l/Jb  <  P*  <  1)  over  the  whole  parameter  space.  Here  the  size  of  the 
selected  subset  is  not  determined  in  advance  but  depends  on  the  data  and  hence  it 
is  a  random  variable.  Among  decision  procedures  which  satisfy  the  basic  probabil¬ 
ity  requirement,  one  which  yields  the  smallest  expected  size  of  the  selected  subset  is 
considered  in  some  ways  to  be  the  most  desirable.  Another  performance  criterion  for 
comparing  decision  procedures  is  the  expected  number  of  the  non-best  populations  in 
the  selected  subset.  Some  recent  contributions  in  the  subset  selection  formulation  have 
been  made  by  Gupta  and  Studden  (1970),  Gupta  and  Nagel  (1971),  Gupta  and  Pan¬ 
chapakesan  (1972),  Gupta  and  Santner  (1973),  Santner  (1973,1975),  Gupta  and  Huang 
(1975a,1975b),  Gupta  and  Huang  (1976),  Bickel  and  Yahav  (1977),  Gupta  and  Singh 
(1980),  Gupta  and  Hsiao  (1983),  Lorenzen  and  McDonald  (1981)  among  others. 

In  the  basic  subset  selection  formulation  we  select  a  nonempty  subset  of  the  k  given 
populations.  When  the  parameters  are  all  very  close  to  one  another,  we  are  likely  to 
select  all  the  populations.  So  it  is  meaningful  to  put  a  restriction  that  the  size  of  the 
selected  subset  will  not  exceed  m  (1  <  m  <  k).  Even  otherwise,  one  may  want  to  select 
a  nonempty  subset  of  a  random  size  to  a  maximum  of  m.  Such  a  formulation  is  called 
a  restricted  subset  selection  formulation.  The  general  theory  was  developed  by  Santner 
(1973,1975)  and  the  normal  means  selection  problem  was  investigated  by  Gupta  and 
Santner  (1973).  An  important  feature  of  this  formulation  is  that  an  indifference  zone 
(preference  zone)  is  introduced. 

Besides  being  a  goal  in  itself,  selecting  a  subset  containing  the  best  can  also  serve 
as  a  first  stage  screening  in  a  two-stage  procedure  designed  to  choose  one  population  as 
the  best.  Some  important  contributions  in  this  direction  have  made  by  A  lam  (1970), 


Tamhane  and  Bechhofer  (1977,1979),  Miescke  (1982),  Gupta  and  Miescke  (1982,1984), 
Gupta  and  Kim  (1984)  and  Lee  and  Choi  (1985). 

There  are  several  other  variations  and  generalizations  of  the  basic  subset  selection 
formulation,  for  example,  the  decision-theoretic  approach  where  some  Bayes  and  empir¬ 
ical  Bayes  rules  and  several  minimax  and  f- mini  max  rules  have  been  studied  by  various 
authors,  selection  procedures  for  multivariate  normal  and  multinomial  distributions, 
nonparametric  procedures,  selection  from  restricted  families,  sequential  procedures, 
isotonic  procedures  etc.  For  further  developments  in  subset  selection  formulation,  ref¬ 
erence  can  be  made  to  Gupta  and  Panchapakesan  (1979),  Gupta  and  Huang  (1981), 
Dudewicz  and  Koo  (1982),  and  Gupta  and  Panchapakesan  (1985). 

The  main  contributions  of  this  paper  are  first,  to  propose  and  study  new  selection 
and  ranking  procedures  for  the  logistic  populations  and  second,  to  propose  an  elim¬ 
ination  type  two-stage  procedure  for  selecting  the  best  population  using  a  restricted 
subset  selection  rule  in  its  first  stage  and  to  apply  this  procedure  to  specific  problems. 

Chapter  2  deals  with  the  basic  selection  and  ranking  procedures  for  logistic  pop¬ 
ulations.  The  range  of  application  of  the  logistic  distribution  as  a  probability  model 
to  describe  random  phenomenon  covers  such  areas  as  population  growth,  bioassay,  life 
test  and  physiochemical  phenomena.  The  exact  distribution  of  the  mean  of  samples 
from  a  logistic  populations  has  not  been  obtained  completely  yet  though  it  is  needed  in 
the  various  studies  about  logistic  distributions  such  as  estimating,  testing  hypothesis 
and  selection  and  ranking  problems.  An  excellent  approximation  to  the  distribution  of 
the  sample  means  from  a  logistic  population  is  derived  by  using  the  Edgeworth  series 
expansion  and  it  is  compared  to  other  approximations.  Using  this  approximation  we 
consider  a  single-stage  indifference  zone  approach  procedure  V\  for  selecting  the  best 
logistic  population.  We  also  propose  two  subset  selection  rules  R\  and  Ri  based  on 
sample  means  and  medians  respectively  and  compare  them  to  each  other  by  means  of 
their  performance  characteristics. 

Chapter  3  considers  an  elimination  type  two-stage  procedure  for  selecting  the  pop¬ 
ulation  with  the  largest  mean  from  k  logistic  populations.  We  propose  a  two-stage 
procedure  Vi  which  is  based  on  an  optimization  problem  by  using  a  minimax  criterion. 


Lower  bounds  for  the  infimnm  of  the  probability  of  a  correct  selection  over  the  pref¬ 
erence  zone  and  the  supremum  over  the  whole  parameter  space  of  the  expected  total 
sample  size  needed  for  V%  are  derived.  A  table  of  the  constants  needed  to  implement 
V2  is  provided  by  solving  the  optimization  problem  and  the  efficiency  of  V2  relative  to 
the  single-stage  procedure  V\  is  investigated. 

Chapter  4  deals  with  a  single-stage  restricted  subset  selection  procedure  for  logis¬ 
tic  populations.  We  consider  a  restricted  subset  selection  procedure  R3  based  on  the 
sample  means  for  selecting  the  population  with  the  largest  mean  from  k  logistic  popu¬ 
lations  when  the  common  variance  is  known.  Formulas  for  the  probability  of  a  correct 
selection  for  any  given  set  of  parameters  and  for  the  infimum  over  the  preference  zone 
of  the  above  probability  are  derived  and  some  properties  of  this  procedure  such  as 
monotonicity  and  consistency  are  studied.  Tables  of  the  bounds  of  the  infimnm  of  the 
probability  of  a  correct  selection  over  the  preference  zone,  tables  of  the  required  sam¬ 
ple  sizes  for  the  rules  and  tables  of  the  expected  number  of  selected  populations  are 
provided.  A  new  design  criterion  to  get  the  needed  sample  size  (n)  and  the  constant 
defining  the  rule  (h)  simultaneously  is  proposed  and  a  table  of  the  constants  (n,  h)  is 
provided. 

Chapter  5  deals  with  a  more  flexible  two-stage  procedure  for  selecting  the  best 
population.  We  propose  an  elimination  type  two-stage  procedure  V2  in  which  a  gen¬ 
eralized  restricted  subset  selection  rule  is  used  in  the  first  stage  and  the  Bechhofer’s 
(1954)  natural  decision  procedure  in  the  second  stage.  This  rule  is  based  on  a  set  of 
consistent  estimators  for  the  parameters,  whose  distributions  are  assumed  to  form  a 
stochastically  increasing  family  for  a  given  sample  size.  We  also  propose  a  non-linear 
optimization  problem  using  a  minimax  design  criterion  to  find  a  set  of  design  constants 
for  A  lower  bound  of  the  probability  of  a  correct  selection  is  derived  and  also  a 
formula  for  the  infimnm  of  the  lower  bound  over  the  preference  zone  is  derived.  An 
analytic  expression  for  the  expected  total  sample  size,  the  conditions  guaranteeing  that 
the  supremum  over  the  whole  parameter  space  of  the  expected  total  sample  size  oc¬ 
curs  at  some  point  where  the  parameters  are  all  equal,  and  a  general  expression  of  the 
supremum  of  the  expected  total  sample  size  under  these  conditions  are  derived.  Finally 


we  apply  to  the  location  parameter  problem  of  univariate  normal  populations  by 
providing  the  tables  of  the  design  constants  to  implement  and  of  the  values  of  the 
relative  efficiency  with  respect  to  the  single-stage  procedure. 


2  SELECTION  AND  RANKING  OF  THE  LO¬ 


GISTIC  POPULATIONS 

2.1  Introduction 

The  logistic  distribution  has  been  widely  used  by  Berkson  (1944,1951,1957)  as  a  model 
for  analyzing  experiments  involving  quantal  response.  Pearl  and  Reed  (1920)  used  this 
in  studies  connected  with  population  growth.  Placket t  (1958,1959)  has  considered  the 
use  of  this  distribution  with  life  test  data.  Gupta  (1962)  has  studied  this  distribution 
as  a  model  in  life  testing  problems. 

The  shape  of  the  logistic  distribution  is  similar  to  the  normal  distribution.  The 
simple  explicit  relationship  between  the  logistic  random  variate,  its  probability  den¬ 
sity  function  (pdf)  and  its  cumulative  distribution  function  (cdf)  render  much  of  the 
analysis  of  the  logistic  distribution  attractively  simple  and  many  authors,  for  example, 
Berkson  (1951)  prefer  it  to  the  normal  distribution. 

The  importance  of  the  logistic  distribution  in  the  modeling  of  stochastic  phenomena 
has  resulted  in  numerous  other  studies  involving  probabilistic  and  statistical  aspects  of 
the  distribution.  For  example,  G umbel  (1944),  G umbel  and  Keeney  (1950)  and  T&lacko 
(1956)  show  that  it  arises  as  a  limiting  distribution  in  various  situations;  Birabaum  and 
Dudman  (1963),  Gupta  and  Shah  (1965)  study  its  order  statistics.  Many  other  authors, 
for  example,  Antle,  Klimko  and  Harkness  (1970),  Gupta  and  Gnanadesikan  (1966)  and 
Tarter  and  Clark  (1965),  investigate  inference  questions  about  its  parameters. 

As  might  be  expected,  because  of  the  similarity  between  the  logistic  and  the  normal 
distributions,  the  sample  mean  and  variance,  the  moment  estimators  of  the  logistic 
population  parameters,  are  effective  tools  for  statistical  decisions  involving  the  logistic 
distribution.  Antle,  Klimko  and  Harkness  (1970)  give  a  function  of  the  sample  mean  as 
a  confidence  interval  estimate  of  the  population  mean  when  the  population  variance  is 
known.  Schafer  and  Sheffield  (1973)  show  that  in  terms  of  the  mean  squared  error  the 
moment  estimators  of  the  logistic  population  parameters  are  as  good  as  their  maximum 
likelihood  estimators.  The  fact  that  the  distribution  of  a  sample  mean  has  monotone 


likelihood  ratio  (MLR)  with  respect  to  the  population  mean  when  the  variance  is  known 
is  used  by  Goel  (1975)  to  obtain  a  uniformly  most  accurate  confidence  interval  for  the 
population  mean  and  a  uniformly  most  powerful  test  for  one-sided  hypotheses  involving 
the  population  mean.  The  sampling  distribution  of  the  mean  is  a  primary  requirement 
for  these  statistical  purposes.  The  papers  by  Antle,  Klimko  and  Harkness  (1970)  and 
Tarter  and  Clark  (1965)  used  a  Monte  Carlo  method  for  this  distribution. 

Goel  (1975)  obtains  an  expression  for  the  distribution  function  of  the  sum  of  inde¬ 
pendent  and  identically  distributed  (it'd)  logistic  variates  by  using  the  Laplace  trans¬ 
form  inverse  method  for  convolutions  of  Polya  type  functions,  a  technique  developed  by 
Schoenberg  (1953)  and  Hirschman  and  Widder  (1955).  He  provides  a  table  of  the  cdf 
of  the  sum  of  iid  logistic  variates  for  the  sample  size  n  =  2(1)12,  x  =  0(0.01)3.99  and 
n  =  13(1)15,  x  —  1.20(0.01)3.99.  He  also  gives  a  table  of  the  quantiles  for  n  =  2(1)15, 
a  =  0.90,0.95,0.975,0.99,0.995.  George  and  Mudholkar  (1983)  obtain  an  expression 
for  the  distribution  of  a  convolution  of  the  itd  logistic  variables  by  directly  inverting 
the  characteristic  function.  However,  since  both  formulas  of  Goel  (1975)  and  George 
and  Mudholkar  (1983)  contain  a  term  (1  —  e*)-fc,  k  —  1, . . .  ,n,  a  problem  of  precision  of 
the  computation  at  the  values  of  z  near  zero  arises  when  n  is  large.  George  and  Mud¬ 
holkar  (1983)  also  show  that  a  standardized  Student’s  t  distribution  provides  a  very 
good  approximation  for  the  distribution  of  a  convolution  of  the  iid  logistic  random 
variables. 

This  chapter  considers  approximation  problems  to  the  distribution  and  quantiles 
of  a  standardized  mean  of  samples  from  a  logistic  population  by  using  Edgeworth  and 
Cornish-Fisher  series  expansions  respectively.  Tables  of  the  cdf  and  quantiles  are  pro¬ 
vided  and  it  is  shown  that  these  axe  far  better  approximations  than  the  Student’s  t 
distributions  as  suggested  in  Goerge  and  Mudholkar  (1983)  and  hence  these  approxi¬ 
mations  will  be  used  henceforth. 

In  the  rest  of  this  chapter  a  single-stage  procedure  V\  using  an  indifference  zone 
formulation  for  selecting  the  best  among  several  logistic  populations  with  unknown 
means  and  a  common  known  variance  based  on  sample  means  is  proposed  and  studied. 
A  table  of  the  smallest  sample  size  n  needed  to  implement  V\  subject  to  the  basic 


probability  requirement  is  provided. 

Two  subset  selection  rules  R\  and  /2a  based  on  sample  means  and  sample  m^iana 
respectively  for  selecting  the  best  among  several  logistic  populations  are  proposed  and 
tables  of  constants  to  implement  the  rules  are  provided.  We  also  compare  the  two  rules 
with  respect  to  their  performance  characteristics. 

2.2  Distribution  of  logistic  sample  means 

2.2.1  Logistic  distribution 

A  random  variable  X  has  the  logistic  distribution  with  mean  p  and  variance  <r3,  some* 
times  denoted  by  Lfao2),  if  the  pdf  of  X  is  given  by 

/(*)  =  ($/*)[®cp{-0(*  -  p)M][  1  +  exp{-0(*  -  p)/<r}]~3  (1) 

and  the  cdf  of  X  is  defined  by 

F(z)  =  [1  +  exp{-<?(x  -  p)/<r}]~\  (2) 

where  — oo  <  x  <  oo,  —  oo  <  p  <  oo,  c  >  0  and  g  ss  x /y/Z.  This  distribution  is 
symmetrical  about  the  mean  p. 

The  standard  logistic  distribution  with  mean  zero  and  variance  unity,  denoted  by 
1(0, 1),  has  the  pdf  and  cdf  defined  as 

/(*)  =  $[exp{-$z}][l  +  exp{— fifz}]'3  (3) 

and 

F(*)  *  [1  +  exp{— y®}]"1  (4) 

respectively,  where  — oo  <  *  <  oo.  The  standard  logistic  distribution  has  a  shape 
similar  to  the  standard  normal  distribution.  The  curve  of  the  logistic  distribution 
crosses  the  density  curve  of  the  normal  between  0.68  and  0.69.  The  inflection  points 
of  the  pdf  of  the  standard  logistic  distribution  are  ±0.53  (approx.). 

Letting  Y  *  (X  —  p)g/t t,  the  random  variable  Y  has  the  logistic  distribution  with 
mean  zero  and  variance  x3/3.  The  pdf  and  cdf  of  the  random  variable  Y  are  given  by 


and 


f(sO  =  [i+«p{-j,}]-' 


(6) 


L$22i 


respectively,  where  — oo  <  y  <  oc.  (5)  may  be  written  in  terms  of  F(y)  as 


m  =  ny)(l  -  F(y)). 


The  moment  generating  function  ( mgf )  of  Y  is  given  by 


My(t)  =  r(i+t)r(i-<) 

=  x</sin?rt,  |t|  <  1. 


We  can  also  express  (8)  as 


My{t)  =  E(-l)i'1[2(23i-1  -  l)/(2 j)!]£2j(*t)2', 

jmO 


where  BJa  are  Bernoulli  numbers  defined  as 


x/(exp(x)  -  1)  =  2  Bvxvl{vVj. 


The  relationships  between  J3„’s  are  given  by 


Bo  =  1 


1  +  •••  +^_  =  0,  i  =  1,..., 


and  hence  the  first  few  values  of  Bu  are 


Bo  =  1, 


Bx  =  -1/2, 

Bi  =  1/6, 

Ba  =  -1/30, 
Bo  =  1/42, 

=  -1/30, 
B\o  —  5/66, 


^Jm+l  —  0,  TO  —  1,  2, .  . .  • 


mmmmm 


The  Vth  central  moments  of  Y ,  denoted  by  fJtv(y),  ran  be  obtained  as 


My)  =  e(yv) 


(— l)v/3”1[2(2*'”1 

°; 


-  1  )\Bvx¥\  if i/  =  2j, j  =  1,2,..., 
otherwise, 


by  using  (9). 

Then  the  Vth  central  moments  of  X,  denoted  by  Mz)>  given  by 


M*)  =  E(x-n)v 

=  (crlgYE{Yv) 


(— l)"/3-1 
1°; 


(v^)“(2(2“-‘  -  1)]B„;  ifx  =  2j,j  =  1,2 . 

otherwise. 


In  terms  of  the  central  moments  iiv{x)  of  X,  first  few  of  the  Vth  cumulants  of  X,  denoted 
by  Ky(x),  v  =  1, 2, . . which  are  defined  by 

logMW-E/fvWfiiJVf*'!), 

v»l 

where  ^x(f)  is  the  characteristic  function  of  the  random  variable  X,  are  given  by 

K\(x)  =  M i(x)  =  ft, 

K2{x)  =  ft2(x)  =  a2, 

K,[x)  =  3(«(x))»  - 

Jf«(x)  =  !!«(x)  -  15/!j(x)(!,(x)  +  30((ij(x))3  =  yff*. 

*.(*)  =  M*)  -  28fi2(x)p9(x)  -  35(M*))3  +  420(MZ))V*(Z) 

-  6300*, (x))<  = 

iC10(*)  =  Mio(*)  -  45/x3(*)M*)  -  210^4(x)Mx) 

+  1260(Mz))3Mx)  +  3150MX)(MZ))2 


-  18900(/ij(x))3/i4(x)  +  22680(/i,(x))5  = 


145152 


K2j+i(x)  —  0,  j  —  1, 2, ... . 


.•.VA'.V.VA-V.V;VA  '.VA>\S\V.\\VA\V.V.VA'.SlV'.  \  ^  S.Y-A  «. 


i.il.it.rtrri'Cr 


The  first  few  relative  cumulants  of  X,  A„(x),  where  A*(x)  is  defined  as 


Ay(x)  =  K,,{x)(K2(x))-''7, 

are  given  by 

M*)  =  /*/*» 

Aa(*)  =  1, 

A4(x)  =  6/5, 

A«(*)  =  48/7, 

Aa(*)  =  432/5, 

Aio(x)  =  145152/77, 

• 

Ajj+l(x)  =  0,  j  as  1,2, —  (13) 

2.2.2  Edgeworth  series  expansions  for  the  distribution  of  the  mean  of  sam¬ 
ples  from  a  logistic  population 

Let  Xi,  X?, . . . ,  Xn  be  a  random  sample  of  size  n  from  a  logistic  population  L{n,  a2) 
with  mean  n  and  variance  a 2  whose  cdf  and  pdf  are  given  in  (1)  and  (2)  respectively. 
Define  a  standardized  mean  of  samples  of  size  n  from  £(/t,<r3),  Z  say,  as 

z  =  ££*-,■> 

=  (14) 

where  X=*l  Xi  is  the  sample  mean. 

Let  /n(s)  and  Fn(z)  denote  the  pdf  and  cdf  of  the  standardized  mean  of  samples 
of  size  n  from  L{p,a2).  Then  the  Edgeworth  series  expansions  of  the  /„(*)  and  Fn(z) 
are  given  symbolically  as 

Mz)  =  4>{z)  +  <t>{z)Y^P,{z)n~}li  +  0(n"(H'1)/3) 


Fn(z)  =  *(z)  -  *(z)  £  J>  (z)n-''2  +  0(n~^2) 
i-l 

respectively,  where  ^(z)  and  $(z)  are  the  standard  normal  pdf  and  cdf  respectively 
and  pj(z)  and  Pj(z)  are  polynomials  in  z,  which  are  obtained  up  to  v  —  10  in  Draper 
and  Tierney  (1973). 

Using pj(z)  and  Pj(z)  from  TABLE  II  of  Draper  and  Tierney  (1973)  and  the  relative 
cumulants  of  X  given  in  (13),  the  Edgeworth  series  expansions  of  the  /„(z)  and  F„(z) 
correct  to  order  n-*'/3,  v  =  4,6,8,  are  given  by 

/„(*,  i/  =  4) 

+ 1(^)(  y)*w  +  (f  )(|)J^.(*)]»-J} + 0(»-5/’), 

F.(»,  •'«■») 

+  K  jfXyUM*)  +  (f)(f)W’}  + 

f»(ZyV  =  6) 

=  /.(*,  "  -  4)  + 

+  0(f)(5)ifloW  +  (^)(|)3ff»(*)]n-3  +  O(n-W),  (15) 

F»(z,  i/  =  6) 

=  /•.(*,»  =  4)  -  ^(*)[(  jXx)*<*) 

+  (|f)(y)(|)*(*)  +  (^)(|)3ft.(*)]n-3  +  0(n-T/1).  (16) 


fn(z,l/  =  8) 

-  /.(*,,  =  6) +  «*)[(i)(i£p)iM*) 

+  (^y^)(!)J(f)ft.(^  +  (^y^Xfm.Mln-  +  0(n-“«) 


Fm(z,  t/  =  8) 


r.<*.  •'  =  «)-  «*)[(^)(^P)*(*) 


+ 

+ 


/495w432w 6.  .  s  (  /462w48^„  ,  > 

X^uW  +  Cyg-Ky)  Hxx^x) 

,105105  w6va/48,  „  ,  ,  .  ,2627625. ,6. 4 „  ,  xt  _A  a/,t 
(-jjpKj)5 *(— )ffu(*) +  (— ef " ’)(&) g*»(*)ln  +  °(n  ). 


where  Hj(x)'s  are  the  Hermite  polynomials  of  degree  j,  which  are  defined  by 

exp(-*a/2)  =  (-l)'ffi(x)  exp(— xa/2),  j  «  0, 1 . 

The  first  thirty  Hermite  polynomials  which  follow  the  recurrence  relation 

*  xHj— i  (x)  ~  (j  —  l)-flj-a(*)»  J  *  2, 3, ... , 

are  given  in  TABLE  HI  in  Draper  and  Tierney  (1973). 

Table  1,  Table  2  and  Table  3  contain  the  values  of  the  cdf  of  the  standardized 
mean  of  samples  of  size  n  from  a  logistic  population  with  mean  /j  and  variance  a3  for 
n  =  3, 10, 15  and  z  *  0.00(0.01)3.99  using  the  Edgeworth  series  expansion  correct  to 
order  n~9  given  in  (16).  Entries  in  the  tables  were  computed  by  using  double-precision 
arithmetic  on  a  Vax-11/780. 


2.2.3  Cornish-Fisher  series  expansions  for  the  quantiles  of  the  mean  of 
samples  from  a  logistic  population 

The  representation  of  a  quantile  of  one  distribution  in  terms  of  the  corresponding 
quantile  of  another  is  widely  used  as  a  technique  for  obtaining  approximations  for 
the  percentage  points.  One  of  the  most  popular  of  such  quantile  representations  was 
introduced  by  Cornish  and  Fisher  (1937)  and  later  reformulated  by  Fisher  and  Cornish 
(1960)  and  is  referred  to  as  the  Cornish-Fisher  expansion. 

By  means  of  formal  substitutions,  Taylor  expansions  and  identification  of  powers 
of  n,  the  Cornish-Fisher  expansion  of  a  quantile  z  of  F„(z)  which  is  the  cdf  of  the 


standardized  mean  of  samples  of  size  n  from  £(/*,  <r2),  in  terms  of  the  corresponding 
normal  quantile  y,  is  of  the  form 

*  =  V  +  E  Qj(y)n~i/7  +  OCn-^1)/2), 

i-l 

where  Qj(y)’s  are  polynomials  of  y,  which  are  obtained  up  to  i/  =  8  in  Draper  and 
Tierney  (1973). 

Using  Qj(y)  from  TABLE  VII  of  Draper  and  Tierney  (1973)  and  A,(x)  in  (13),  we 
obtain  the  Cornish-Fisher  series  expansions  for  the  quantiles  z  of  Fn(z)  up  to  order 
i/  =  4,6,8  as  follows: 

=  4)  -  »  +  t(jf)(|)(»>-3»)ln'1 

+  [(flXy)(/-10/  +  15s) 


,(v*6)  = 


+  (JXjJW  +  72  y3  -  87y)]n-a  +  0{n~*'7), 

1  432 

x(v  *  4)  +  l(g[)(~)(»7  -  21y8  +  105y3  -  105y) 

+  (^)(y)(!)(-15»?  +  255/  -  1035/  +  855s) 

+  (^)(|)’(2«/  -  3537/  + 12177/  -  8667»)]n‘ 
+  0(  »-»/») 


i  145152 

=  6)  +  l(,5f)(iyy-)(/  -  36/  +  378/  -  1260s’  +  945s) 
+  (^)(^)(l)(-21»’  +  530s7  -  5502s’  +  15330s’  -  9765s) 

12!  3  0 

+  (~)(y)2(-25y*  +  700y7  -  5850ys  +  15900y3  -  9945y) 

+  (^p)(|)3(y)(49V  -  12510y7  +  92370y5 
—  219810y3  +  121455y) 

+  (2g27628  )(|)4(— +  259848y7  -  1686906y5 
16!  5 


rwsw i 


'J  w, 


+  3539376y3  -  1743471y)]r»-4  +  0{n~9/7). 


Table  4  provides  the  quantiles  of  the  distribution  of  the  standardized  mean  of 
samples  from  the  logistic  population  for  sample  sizes  n  =  3(1)10(5)30  and  probability 
levels  a  =  0.900,0.950,0.975,0.990,0.995  using  the  Cornish-Fisher  series  expansions 
correct  to  order  v  =  4, 6  and  8  respectively.  Entries  of  the  table  were  calculated  by 
using  double-precision  arithmetic  on  a  Vax-1 1/780. 

2.2.4  Legitimacy  of  using  the  Edgeworth  and  Cornish-Fisher  series  expan¬ 
sions 

Noting  the  similarity  of  the  distribution  of  Z  in  (14),  the  standardized  mean  of  sam¬ 
ples  from  L(n,  a3),  to  the  normal  distribution  in  shape  except  its  relatively  longer 
tails,  George  and  Mudholkar  (1983)  compare  the  three  approximations,  that  is,  the 
standard  normal  distribution,  the  Edgeworth  series  expansion  correct  to  order  n-1  and 
the  standardized  Student’s  t  distribution  to  the  exact  distribution  of  Z.  In  using  the 
standardized  Student’s  t  distribution,  they  use  the  degree  of  freedom  £  =  5n  -f  4  which 
can  be  obtained  by  equating  the  coefficients  of  kurtosis.  They  show  that  the  Student’s 
t  distribution  provides  a  very  good  approximation. 

We  show  here  that  the  Edgeworth  and  Cornish-Fisher  series  expansions  correct  to 
order  n-3,  which  are  given  in  the  (16)  and  (17)  respectively,  are  far  better  approxima¬ 
tions  than  even  the  Student’s  t  distribution  in  George  and  Mudholkar  (1983). 

Table  5,  Table  6  and  Table  7  illustrate  the  quality  of  the  four  approximations.  In 
Table  5  the  four  approximations,  that  is,  the  standard  normal,  the  Edgeworth  series 
expansion  correct  to  n_1,  the  standardized  Student’s  t  and  the  Edgeworth  series  expan¬ 
sion  correct  to  order  n"3  are  compared  to  the  exact  distribution  given  in  Goel  (1975). 
The  approximation  using  the  Edgeworth  series  expansion  correct  to  order  n-3  appears 
to  be  superior  to  the  other  three  by  noting  that  the  maximum  error  is  about  0.0001  as 
shown  in  the  last  column  of  Table  5.  In  Table  6,  the  exact  values  of  the  distribution 
function  for  n  =  10  tabled  by  Goel  (1975)  are  compared  with  the  values  obtained  from 


a*  t<  *«  4«  it.  jK  |t  A».  A 


the  approximations  using  Student’ t  and  the  Comish-Fisher  series  expansion  correct 
to  order  n“s.  In  Table  7,  the  exact  quantiles  for  n  =  2, 3, . . . ,  15  tabled  by  Goel  (1975) 
are  compared  with  the  corresponding  approximations  obtained  from  the  Student’ t 
distribution  and  the  Comish-Fisher  series  expansion  correct  to  order  n-3.  From  these 
tables,  it  is  clear  that  for  sample  size  7  or  more  the  Edgeworth  series  expansion  correct 
to  order  n-3  provides  an  excellent  approximation  for  the  standardized  mean  of  sample* 
from  the  logistic  distribution.  Consequently,  we  will  use  the  Edgeworth  series  expan¬ 
sion  correct  to  order  n~s  as  an  approximation  to  the  distribution  of  the  standardized 
mean  of  the  samples  from  the  logistic  distribution  henceforth. 

2.3  A  single-stage  procedure  V\  for  selecting  the  population 
with  the  largest  mean  from  k  logistic  populations 

Bechhofer  (1954),  in  introducing  the  indifference  zone  formulation,  considered  the  prob¬ 
lem  of  ranking  means  of  normal  populations  with  a  common  known  variance.  Here  we 
consider  a  single-stage  procedure  using  an  indifference  zone  approach  for  selecting  the 
population  with  the  largest  mean  from  k  logistic  populations  when  they  have  a  common 
known  variance. 


2.3.1  Statement  of  the  problem 


Let  Ti,  •  ■  • ,  t*  be  k  independent  logistic  populations  with  unknown  means  m  and  a 
common  known  variance  <rJ.  Let  /ijjj  <  •  •  •  <  be  the  ranked  We  assume  that  it 
is  not  known  which  population  is  associated  with  /*[,-],  t  =  1, . . . ,  Jb.  We  further  assume 
that  a  population  is  characterized  by  its  population  mean  and  the  ‘best’  population  is 
the  one  having  the  largest  mean.  * 

Our  procedure  will  be  based  on  the  sample  means.  Let  Xi,  t  =  1, . . . ,  Jb,  denote 
the  means  of  independent  samples  of  size  n  from  ***  population.  The  sample  mean 
associated  with  population  having  population  mean  /*(,-]  will  be  denoted  by  that 
is,  the  expected  value  of  X(q  is  Let  X[*j  <  •  •  •  <  be  ranked  X,-.  If  X,  =  X; 
for  t  j,  due  to  the  limitations  of  the  measuring  instrument,  the  tied  means  should  be 


‘ranked  using  &  randomized  procedure  which  assigns  equal  probability  to  each  ordering. 

Assuming  that  the  goal  of  the  experimenter  is  to  select  the  best  among  the  k 
populations,  we  propose  a  single-stage  procedure  V .  as  follows. 

Procedure  V-\ ;  Take  n  observations  from  the  ***  population  for  each  *  =  1, . . . ,  jfc. 
Compute  the  k  sample  means  7i, . . .  ,X*.  Select  the  population  associated  with 
as  the  best  one. 

Defining  the  event  of  the  experimenter’s  selection  of  the  best  population  with  V\  as 
[CS\Vi],  the  probability  of  a  correct  selection  with  the  procedure  V\,  P{CS\Pi }  can 
be  written  as 

P{CS\V\}  =  Pjt[tht  best  population  is  selected 
=  PrfX^k)  >  maxi<j<*  Y(j)] 

=  -P/ifTt*)  >  X(j),j  =  l,...,k  — 1  ] 

=  Pg[(y/nl<T)(X^  -  py\)  <  (y/n/o)(X(k)  -  P[k]) 

+  (y/n/*)(m  -  /*«),  j  =  1, ...» At  —  1] 

■  n  Fn(z  +  (Vn/<r)(ti[k]  -  /iW))dF„(z),  (18) 

■/~00 

where  Fn(z)  is  the  cdf  of  the  standardized  mean  of  samples  from  a  logistic  population. 

For  the  fixed  values  of  the  m  and  o 3  the  probability  of  a  correct  selection  will 
depend  only  on  the  sample  size  n.  We  propose  to  design  the  experiment  in  such  a  way, 
that  is,  choose  the  n  in  such  a  way  that  under  specified  conditions  the  probability  of  a 
correct  selection  with  procedure  V\  will  be  equal  to  or  greater  than  some  preassigned 
value  P*. 

2.3.2  Determination  of  the  sample  sizes 

Now  for  the  problem  to  be  meaningful  P*  lies  between  l/k  and  1.  Since  the  true  values 
of  the  pi  are  not  known,  we  need  the  probability  of  a  correct  selection  to  be  at  least 
P *  whatever  be  the  values  of  the  /i,.  Thus  we  are  interested  in  the  configuration  of 
the  pi  for  which  the  probability  in  (18)  is  a  minimum.  Such  a  configuration  will  be 
called  a  least  favorable  configuration  (LFC).  It  is  obvious  that  the  LFC  is  given  by 


IH i]  =  •  •  ■  =  fi[k]'  But  unfortunately  the  minimum  value  of  the  probability  in  this  LFC 
case  is  l/k.  So  we  cannot  achieve  the  probability  requirement  whatever  be  the  sample 
size  unless  some  modification  is  made  in  the  probability  requirement. 

A  natural  modification  is  to  insist  on  the  minimum  probability  P*  of  selecting  the 
best  population  whenever  the  best  is  sufficiently  far  apart  from  the  next  best.  In 
other  words,  the  experimenter  specifies  a  positive  constant  6  and  requires  that  the 
probability  of  selecting  the  best  population  is  at  least  P*  whenever  (/q*j  —  M[*-i])  ^  S. 
The  specification  of  6  provides  a  partition  of  the  parameter  space  ft  where 

ft  =  {/*  =  (a*i»  *••,/**);  -00  <  Hi  <  oo,  i  =  1, . . . ,  fc}  (19) 

into  two  parts,  namely  ft(£)  where 

ft(*)  =  {£  €  ft)  (n[k)  -  ti[k- 1])  >  5}  (20) 

and  the  compliment  fte(6)  of  ft(£).  The  minimization  of  the  probability  of  selecting  the 
best  population  is  over  fl($).  For  an  obvious  reason,  ftc(£)  was  called  the  indifference 
zone  by  Bechhofer  (1954).  Subsequent  authors  have  termed  ft(6)  the  preference  zone. 

It  is  now  easy  to  see  that  the  LFC  in  ft(6)  is  given  by 

ft°(*)  =  V*  €  ft(*)l  MU1  =  Plk-i)  =  m  -  5}  (21) 

and  the  minimum  sample  size  required  is  the  smallest  integer  n  for  which 

inf  P,[CS\V,\  =  f  (Fn(.  +  (</ZI<T)6))'-'dF„(z)  >  P-.  (22) 

J-oo 

A  table  has  been  prepared  to  assist  the  experimenter  in  designing  the  experiments  to 
meet  the  above  goal.  Table  8  is  to  be  used  for  designing  experiments  involving  k  logistic 
populations  to  decide  which  has  the  largest  population  mean.  The  table  provides  the 
estimates  n  of  the  values  of  minimum  sample  size  n  associated  with  the  probability 
P m  =  0.75, 0.90, 0.95, 0.99  for  Jfc  =  2, 3, 4, 5, 10, 15,  and  S/a  =  0.1, 0.5, 1.0, 2.0, 4.0.  These 
were  computed  by  setting  the  left  hand  side  of  (22)  equal  to  P*.  The  minimum  sample 
size  n  can  be  obtained  by  n  =  [n  +  1]  where  [f]  denotes  the  greatest  integer  which 
is  less  than  t.  All  computations  were  carried  out  in  double-precision  arithmetic  on  a 
Vax-1 1/780. 


2.4  Subset  selection  procedures 

Gupta  (1956)  introduced  a  subset  selection  formulation  as  a  multiple  decision  problem, 
where  the  investigation  was  carried  out  for  the  case  of  normal  means.  Here  we  consider 
the  subset  selection  rules  for  selecting  the  population  with  the  largest  mean  from  A 
logistic  populations.  We  propose  two  subset  selection  rules  Ri  and  R2  based  on  sample 
means  and  sample  medians  respectively,  provide  tables  for  implementing  these  rules, 
consider  the  performance  characteristics  of  each  rule,  and  we  compare  the  two  rules  to 
each  other. 

2.4.1  Statement  of  the  problem 

Let  s’,-,  *  =  1, . . . ,  A,  be  k  independent  logistic  populations  with  unknown  means  m 
and  a  common  known  variance  a7.  Let  <  •  •  •  <  be  ranked  m  and  *•(,-)  be 
the  population  with  mean  We  assume  that  it  is  not  known  which  population  is 
associated  with  *  =  1, . . . ,  k.  We  further  assume  that  a  population  is  characterized 
by  its  population  mean  and  the  ‘best’  population  is  the  one  having  the  largest  mean, 
that  is,  X(fc). 

Let  j  1, . . .  jS,  denote  a  random  sample  from  x,,  i  —  1,  •  •  • ,  A,  where  the 
observations  within  and  between  populations  are  all  independent.  Let  X,-  and  Xu, 
*  =  1, . . . ,  A,  n  =  2/  —  1,  denote  the  means  and  medians  of  samples  of  size  n  from  x,- 
respectively.  The  sample  mean  and  the  sample  median  associated  with  the  population 
having  population  mean  fift  will  be  denoted  by  X(<)  and  X(i):h  *  =  1, ...»  A,  respectively. 
Let  X[i]  <  •  •  •  <  X(k]  and  JQijj  <  •  •  •  <  X[k]:i  be  ranked  X,  and  Xyj  respectively. 

The  goal  is  to  select  a  small  but  non-empty  subset  S  of  the  A  populations  so  that 
the  selected  subset  includes  with  a  high  probability  P*  the  ‘best’  population.  The  size 
of  the  selected  subset  5  is  an  integer- valued  random  variable  taking  on  values  1, . . . ,  A. 

Let  us  define  the  two  subset  selection  rules  Hi  and  Rj  based  on  the  sample  means 
and  sample  medians,  respectively,  as  follows; 

Rule  Ri  :  select  x,  iff 

X,  >  max  X>  -  hi<r/Vn,  i  =  1, . . . ,  A,  (23) 


*«*  «.■  «>»  «.*  «»»  <■■>  ««>  M  !>*  «.»  t»»7^*nv?y3r?«r«r 


jRu/e  Ri  :  select  Xi  iff 

Xu  >  max  Xjj  -  h2c/ y/n, *  =  l,...,fc,  (24) 

where  hi  and  hi  are  nonnegative  constants. 

By  defining  the  events  [C5|/2,-],  *  =  1,2,  as  selections  of  any  non-empty  subset  of 
k  populations  which  includes  the  best  population  using  Ri,i  =  1,2,  respectively,  it  is 
required  that  for  any  pi  €  ft 

Pn[CS\Ri]  >  P*,  (25) 

where  P*  €  (1/Jb,  1)  and  ft  is  the  parameter  space  given  by  (19). 

The  requirement  of  (25)  will  be  called  as  the  basic  probability  requirement  or  the 
P*-condition. 

Remark  2*1  Lorenzen  and  McDonald  (1981)  used  a  subset  selection  rule  R  based  on 
sample  medians  defined  as 

Rule  R  select  x ,•  iff 

JCu  ^  max  Xu  d  ,  d  ^  0,  %  —  1, . . . , !», 
iSiS*  1 

where  Xu  is  defined  as  above.  Here  we  use  Ri  instead  of  R  only  for  the  purpose 
of  comparing  R\  to  Ra  easily.  Basically  the  rule  Ri  is  the  same  as  Lorenzen  and 
McDonald's  rule  R. 

2.4.2  Probability  of  a  correct  selection 

•  Probability  of  a  correct  selection  for  the  means  rule  R\ 

Using  (23)  we  ran  write  the  probability  of  a  correct  selection  for  the  rule  R\  as 
follows.  For  ft  6  ft, 

Pficsw 

—  Pjj[X(*)  >  maxi<j<*  7;  —  h\<r / y/n  ,hi  >0] 

=s  Pi*(X(fc)  >  X(,-)  —  h\ojy/n  ,  V;  =  1, . . . ,  k  —  1  ] 


=  ??[{'/*! <?)(%{])  -  M[fl)  <  (\/n/<r)(X(fc)  -  /*[*])  +  hi 

+  -  Pm)>  Vj  8  1, . . . ,  k  -  1  ] 

=  n  *•(*  +  hi  +  {y/n/<r){n[k]  -  p\j\))dFn(z),  (26) 

J-co  j_x 

where  Fn(z)  is  the  cdf  of  the  standardized  mean  of  samples  from  a  logistic  distribution. 

We  see  from  (26)  that  the  infimnm  over  the  parameter  space  of  the  probability  of 
a  correct  selection  for  the  rule  R\  takes  place  when  pi  =  -  ••  =  Hk  and  so 

w  fycsijy = jT  {f„(z + hl)}i-'dF'(z).  (27) 

That  is,  the  LFC  for  the  rule  Ri  is  fi°  where 

Cl°  =  {/?  6  ftj  pi  =  •  •  •  =  Pfc  =  p)  (28) 

and  the  Pp[CS\R\]  in  the  LFC  does  not  depend  on  this  common  /*.  Hence,  if  we  choose 
hi  to  satisfy 

r  {F*(*  +  •P’  (29) 

J— oo 

then  we  have  determined  the  smallest  hi  for  which 

(30) 

It  should  be  noted  that  hi  =  hi(n,  k,  P*)  depends  on  n  as  well  as  k  and  P*  unlike 
the  normal  populations  problem. 

Table  9  and  Table  10  give  the  values  of  hi  =  hi(n,  k,  P*)  which  satisfy  (29)  for  n  = 
1(1)10,  k  as  2(1)10  and  1 P*  as  0.75,0.90,0.95,0.975,0.99.  We  use  the  Edgeworth  series 
expansions  correct  to  order  n~3  for  F„(z)  and  /„(z)  ,  the  Gauss-Hermite  quadrature 
algorithm  with  sixty  nodes  for  the  evaluation  of  the  integrals  and  the  modified  regular 
falsi  algorithm  for  solving  the  non-linear  equation.  The  entries  were  calculated  by  using 
double-precision  arithmetic  on  a  Vax-1 1/780. 

•  Probability  of  a  correct  selection  for  the  medians  rule  R? 

Let  Zb i, . . . ,  Zi:n  be  a  random  sample  of  size  n,  where  n  is  an  odd  integer,  drawn 
from  the  standard  logistic  population.  Then  it  is  well  known  that  the  sample 


median,  denoted  by  Zu,  {n  =  21  —  1),  has  the  pdf 

S.W  -  ^[F(*)]w[l  -  FWf» 

and  the  cdf 

Gn(z)  =  I{F(z);ltl},  (31) 

where  f(z)  and  F(z)  are  the  pdf  and  cdf  of  the  standard  logistic  population  given  by 
(3)  and  (4)  respectively  and  /{y;  o,  6}  is  the  incomplete  beta  function  with  parameters 
a  and  6,  which  is  given  by 

~ ' “r'dw ■  <32) 

Now  the  probability  of  a  correct  selection  for  the  medians  rule  Ri  can  be  written 
as  follows.  For  jt  €  Cl 


Pjt[CS\R7] 


—  Pp{X(k)j  >  maxi<,<*  Xtf.j  —  h2tr/y/n  ,  h2  >  0] 
=  P?[X(k)J  >  X(j)j  -  h2<r/y/n  ,  Vj  =  1, . . . ,  k  -  1  ] 
*  Pp[Z(jyj  ^  Z(hy  '•  h/y/n 

+  (m  -  Y? »  i,  . . . ,  k  -  i  ] 


-  X 


OO  *-l 

II  Gn(t  +  h2/y/n  +  (^[*j  -  fiyi)/<r)dGn(t), 
1 


(33) 


where  Gn  is  given  by  (31). 

We  see  that  the  infitnum  over  Cl  of  the  probability  of  a  correct  selection  for  the  rule 
Ri  takes  place  when  p\  =  •  •  •  =  Hk  —  p  and  so 


Pf[CS\Ra]  =  jT  {Gw(<  +  h2/W}k-ldGn(t ).  (34) 

Hence,  if  we  choose  h2  to  satisfy 


r{Gn(t  +  h2/W}k-ldGn(t)  =P\  (35) 

«/— OO 

then  we  have  determined  the  smallest  h2  for  which 


>  P*. 


(36) 


The  values  of  ha/y/n  =  A3(n,i,P*)/v/n  which  satisfy  (35)  for  n  =  1(2)19,  k  = 
2(1)10  and  P*  =  0.75,0.90,0.95,0.975,0.99  were  given  in  TABLE  I  of  Lorenzen  and 
McDonald  (1981). 

2.4.3  Performance  characteristics 

In  this  section  some  performance  characteristics  of  the  subset  selection  procedures  R\ 
and  R-2  are  studied. 

Let  *  =  1,...,&,  j  =  1,2,  denote  the  probabilities  of  including  in  the 

subset  the  population  *(,-),  that  is,  the  Ith  ranked  population,  using  the  rule  Rj  for  the 
/?  6  fl,  then  for  i  =  1, . . . ,  k, 

P*[x<.->|&]  =  S  max1£i<»  X,  -  httr/y/n  ,  kt  >  0) 

«  jH  n  W  +  +  (v^Mwi  -  (37) 

where  Fn(t)  is  the  cdf  of  the  standardized  mean  of  samples  of  size  n  from  a  logistic 
distribution  and 

P?[T(i)\Ri\  =  ^  naaxi<j<*  -  h2cr/ y/n  ,  >  0] 

=  /  II  +  W +  (^M  ~  AHj])/ <r)dGn(t),  (38) 

■/-°°  ,-i 

where  Gn(t),  given  in  (31),  is  the  cdf  of  the  median  of  samples  of  size  n,  where  n  is  an 
odd  integer,  from  the  standard  logistic  population. 

It  is  easy  to  see  that  the  expected  sizes  of  the  selected  subset  using  the  rule  Rj  for 
€  0,  denoted  by  j  =  1,2,  are  given  as  follows: 

(39) 

1=1 

Consistent  with  the  basic  probability  requirement,  we  would  like  the  size  of  the  selected 
subset  to  be  small. 


The  expected  numbers  of  non-best  populations  selected  by  rule  Rj  for  p  €  ft, 
denoted  by  Eg[5*|Jij],  j  =  1,2,  are  defined  as 

£4m)  =  E  (40) 

and  also  we  wovdd  like  the  value  of  the  to  be  small. 

In  using  the  rule  Rj,  j  =  1,2,  the  ranks  of  the  selected  populations  are  random 
variables  and  one  may  want  to  evaluate  the  expected  sum  of  ranks  of  the  selected 
populations.  Let  the  population  with  parameter  /ipj  be  assigned  rank  *,  *  =  1, . . . ,  Jb. 
Then  the  expected  sums  of  ranks  of  the  selected  populations  by  rule  Rj  for  /T  €  fi, 
denoted  by  Ep[SR\Rj],  j  =  1,2,  are 

£WSB|vy  =  £u>j[x(j)|jy.  <4i) 

«Bl 

For  given  jt  €  ft,  the  expected  proportions  of  the  selected  populations  by  the  rule 
Rj,  denoted  by  Ep[P\Rj\,  j  —  1,2,  are  given  by 

EdP\Ri]  =  JEa[S|iy/*.  (42) 


Since  the  values  of  j  —  1,2,  depend  on  /I  €  ft,  we  consider  them 

for  the  two  special  cases,  namely  the  equally  spaced  configuration  and  the  slippage 
configuration. 

First,  for  the  equally  spaced  configuration,  we  assume  that  the  unknown  means  of 
the  k  populations  are  fi,ii+8<7,  •  •  • ,  fi  +  (k— \)8<r  which  have  ranks  1, . . . ,  k  respectively. 
Then  the  probabilities  of  including  in  the  subset  the  population  T(,-)  using  the  rule  Rj 
for  this  configuration,  denoted  by  /%[*■(«) | f?,],  j  =  1,2,  are  given  by 

P«[*{i)\Ri]  -  r  n  **»(*  +  hr  +  (i-  j)8y/n)dFn(t)  (43) 

—  “OO  jmi 


and 


P«MR*}  —  j  II  Gn(t  +  W y/n  +  (*  —  j)£)d(7n(t) 

OO  4—1 


(44) 


respectively. 

.  Next,  for  the  slippage  configuration,  we  assume  that  the  unknown  means  of  k 
populations  are  p^  =  p,  j  =  1, . . . ,  fc  —  1,  and  /q*]  »  p  +  8a  for  6  >  0.  Then  the 
probabilities  of  including  in  the  selected  subset  the  population  *■(,-)  using  the  rule  Rj, 
denoted  by  P*p[*(i\\Rj\,  j  =  1,2,  are  given  by 

=  /  {-fn(t  +  hi)}*-2Fn(t  +  hi  —  8y/n)dFn(t)  ,  *  =  1, . . . ,  fc  —  1, 

-/-OO 

P»[*w\Ri]  =  r  {Fn(t  +  hx  +  6^}k-'dFn(t), 

«/— o o 

«  /-  {G"(*  +  ^/V^)}k’2Gn(t  +  hi/ Sr T  -  5)dGn(t),  *  =  1, . . . ,  k  -  1, 

J—oo 

and 

=  r  <G»(‘  +  WVS  +  «)}‘-VGn((). 

•/—  oo 

Now  we  can  compute  the  performance  characteristics  £p[5|.Rj],  ^[5*|f2,],  Ep[SR\Rj] 
and  Eff[P\Rj]  for  two  special  configurations  by  substituting  P«l*wm  ^•p[7r(«)|-Rj] 
for  Pjj[x(j)|i?,]  in  (39),  (40),  (41)  and  (42)  respectively. 

Table  11  and  Table  12  give  the  values  of  the  performance  characteristics  of  the 
means  rule  R\  and  Table  13  and  Table  14  give  the  same  values  of  the  medians  rule  jRj 
for  the  equally  spaced  and  the  slippage  configurations  respectively  for  the  given  values 
oik  =  2, 3, 4, 5, 10,  P*  =  0.90,  n  =  3  and  y/n6  =  0.5, 1.0, 1.5, 2.0, 2.5, 3.0, 4.0, 5.0. 

For  instance,  from  Table  11  for  P*  =  0.90,  n  =  3,  k  =  5  and  Sy/n  =  1.5,  the 
probability  of  a  correct  selection  by  using  the  means  rule  R\  is  0.997.  The  expected 
size  of  the  selected  subset  is  2.208  and  the  expected  number  of  the  non-best  populations 
selected  is  1.211.  The  expected  sum  of  the  ranks  in  the  selected  subset  is  9.330  and  the 
expected  proportion  of  the  selected  population  is  0.442.  It  should  be  noted  that  the 
expected  sum  of  ranks  by  itself  is  not  a  good  criterion  of  the  performance  of  a  selection 


rule.  It  should  be  looked  at  together  with  the  expected  values  of  5  and  5"  to  ttiaW*  a 
more  meaningful  performance  characteristic. 

The  entries  in  all  tables  were  calculated  by  using  double-precision  arithmetic  on  a 
Vax-11/780. 

Note  that,  for  both  rules  Pi  and  Pj  and  for  the  fixed  values  of  P*,  n,  k  and 
t  =  l,2,...,Jb  —  l(fc),  the  probability  of  selecting  the  t**  ranked  population  in  the 
slippage  configuration  can  be  proved  to  be  monotonically  decreasing(  increasing)  with 
6y/n  and  hence  monotonically  decreasing(increasing)  with  6  and  n  separately.  Also  for 
i  =  l(Jfc),  the  probability  of  selecting  the  Ith  ranked  population  in  the  equally  spaced 
configuration  can  be  proved  to  be  monotonically  dec reasing( increasing)  with  6y/n.  A 
look  at  the  table  values  seems  to  indicate  that,  for  both  rules  R\  and  R-i  and  for  the 
fixed  values  of  P~,  n,  k  and  t  =  2, . . . ,  Jfc  —  1,  the  probability  of  selecting  the  ***  ranked 
population  in  the  equally  spaced  configuration  is  also  monotonically  decreasing  with 
Sy/n.  For  fixed  P*,  *,  n  and  Sy/n,  the  probability  of  selecting  the  Ith  ranked  population 
is  monotonically  decreasing  with  the  values  of  k  for  all  *,  i  =  1, . . . ,  k. 

2.4.4  Comparison  between  the  means  rule  Pi  and  the  medians  rule  P2 

In  this  section  we  compare  the  efficiency  of  the  means  rile  Pi  to  that  of  the  medians 
rule  R2.  Lorenzen  and  McDonald  (1981)  have  studied  the  problem  of  large  sample 
comparisons  between  the  two  rules  R\  and  R2.  They  computed  the  asymptotic  relative 
efficiency  (ARE)  of  Pi  relative  to  R2  defined  by,  for  e  €  (0, 1)  and  $  e  D, 

where  N^,  j  =  1,2,  are  the  numbers  of  observations  needed  so  that 

inf  Pt[CS\Rj]  =  P* 

and 

E,[s-|jy  =  € 


by  assuming  a  slippage  configuration,  that  is, 


Their  value  of  the  ARE(Ri,  R2;  jt)  is  0.822.  Thus,  under  a  slippage  configuration, 
asymptotically  the  means  procedure  requires  about  82%  of  the  sample  size  required  by 
the  medians  rule  to  achieve  the  same  expected  number  of  non-best  populations  in  the 
selected  subset. 

Now  we  consider  the  small  sample  comparisons  between  the  rules  R\  and  Ri  by 
using  the  performance  characteristics  of  each  rule  given  in  the  previous  section.  In 
Table  15,  we  compute  the  values  of  the  probability  of  a  correct  selection  (P(CS')),  the 
expected  sizes  of  the  selected  subset  (E(S)),  the  expected  numbers  of  non-best  popula¬ 
tions  in  the  selected  subset  (E(S*)),  the  expected  sums  of  the  ranks  of  the  populations 
selected  in  the  subset  ( E(SR ))  and  the  expected  proportions  of  the  populations  selected 
in  the  subset  ( E(P ))  for  each  rule  Ri  and  Ri  and  the  ratio  of  those  values  of  the  rules 
when  the  unknown  means  are  equally  spaced  for  the  selected  values  of  P"  =  0.90, 0.95, 
n  =  3,5,  k  =  4,  and  8y/n  —  1.5, 3.0.  The  same  values  for  the  slippage  configurations 
are  given  in  Table  16. 

For  both  of  the  configurations; 

•  Since  P(CS\Ri)/P(CS\Ri)  >  0.991  for  all  cases,  the  values  of  P(CS)'a  are  not 
much  different  for  all  cases. 


•  Since,  for  example,  P(5|i2i)/P(5|i23)  <  1  for  all  cases,  the  values  of  E(S),  E(Sm), 
E(SR)  and  E(P)  for  the  rule  R\  are  less  than  or  equal  to  the  same  values  for 
the  rule  Ri  for  all  cases. 


•  The  values  of  the  ratio  of  the  rules  R\  and  Ri  for  all  characteristics  are  decreasing 
as  the  values  of  n  are  increasing. 


Hence,  as  expected,  the  means  rule  R\  is  definitely  better  than  the  medians  rule  Ri 
in  the  sense  of  their  performance  characteristics  and  the  performance  of  the  rule 
relative  to  the  rule  Ri  improves  as  sample  sizes  are  increasing  for  both  configurations. 
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Table  1:  Approximate  cdf  of  the  standardized  mean  of  samples  from  a  logistic  popula¬ 
tion:  Sample  size  n  =  3. 
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09777 

09783 

09786 

0.9791 

09796 

0.9799 

3.10 

0.9004 

asoos 

09613 

09816 

09630 

09833 

09837 

09831 

09834 

0.9838 

2J0 

0.9041 

a 9845 

09848 

09851 

09854 

0.9867 

09861 

09864 

09868 

0.9869 

3.30 

0.9073 

0.9075 

09878 

09880 

09883 

09685 

09688 

09890 

0.9693 

0.9896 

3.40 

0.9097 

0.9900 

09903 

09904 

0.9906 

09908 

09910 

0.9912 

0.9914 

09916 

2JK) 

0.9918 

a9930 

09933 

09933 

09935 

09937 

09938 

09930 

09931 

09933 

3.00 

0.9838 

aooso 

09937 

0.9939 

0.9940 

09943 

09943 

09944 

09946 

09947 

2.70 

0.9948 

a 9949 

09960 

09961 

09953 

0.9954 

09965 

0.9956 

09967 

0.9968 

seo 

09969 

0.9960 

09961 

0.9963 

0.9963 

09963 

0.9964 

0.9965 

09966 

09967 

2.90 

0.9908 

asoos 

09969 

09970 

09971 

09971 

0.9973 

09973 

0.9973 

0.9974 

3.00 

0.9975 

a 9975 

09976 

09976 

09977 

09977 

0.9978 

0.9979 

09979 

0.9960 

3.10 

0.9900 

0.9901 

09981 

09981 

09983 

09963 

09983 

09963 

09964 

0.9004 

030 

aooe* 

assss 

09985 

0.9986 

0.9986 

09986 

09987 

09967 

0.9967 

0.9968 

3J0 

0.9908 

asoos 

09989 

0.9989 

09989 

0.9989 

09690 

0.9990 

0.9990 

0.9990 

3.40 

a9991 

0.9991 

09991 

09991 

0.9993 

09993 

0.9993 

0.9993 

0.9993 

0.9993 

OSO 

0.9993 

a9993 

09993 

09993 

0.9993 

0.9994 

0.9994 

0.9994 

0.9994 

0.9994 

3.00 

0.9994 

0.9996 

09996 

0.9996 

0.9995 

09996 

0.9996 

0.9996 

0.9996 

0.9996 

3.70 

0.9996 

0.9996 

09996 

09996 

0.9996 

09996 

0.9996 

0.9996 

0.9997 

0.9997 

sjo 

0.9997 

0.9907 

0.9997 

0.9997 

0.9997 

0.9997 

0.9997 

0.9997 

0.9997 

0.9997 

3.90 

0.9997 

0.9997 

0.9998 

0.9996 

09996 

0.9998 

0.9996 

0.9998 

0.9998 

0.9998 
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Table  2:  Approximate  cdf  of  the  standardized  mean  of  samples  from  a  logistic  popula¬ 
tion:  Sample  size  n  =  10. 


s 

.00 

.01 

.03 

.03 

.04 

.06 

.06 

.07 

.08 

.09 

a 

OMJOO 

08040 

08081 

08131 

05183 

05303 

05343 

0.5283 

05334 

0.5364 

aio 

0,5404 

Q4444 

08488 

05835 

Q4565 

05605 

05645 

05685 

05735 

0.5764 

0l3O 

Q43Q4 

08844 

05933 

05983 

06001 

06040 

06079 

0.6118 

0.6157 

030 

a0l98 

0.8334 

08373 

08311 

08349 

06387 

08435 

0.6463 

06500 

0.6538 

040 

0.0875 

08613 

08849 

06886 

06733 

0.6759 

0.6796 

0.6831 

0.6867 

0.6903 

050 

a 0038 

0.6974 

07009 

07044 

07079 

07113 

07148 

07183 

07316 

0.7250 

a  oo 

0.7383 

37317 

07360 

07383 

07416 

07448 

07480 

07513 

0.7544 

0.7576 

070 

37807 

0.7838 

07889 

07700 

07730 

07761 

07791 

0.7830 

0.7850 

0.7879 

080 

37908 

0.7937 

07965 

07994 

08033 

08060 

08077 

08104 

0.8133 

0.8158 

080 

08188 

0.8311 

08337 

08363 

0.8289 

08314 

0.8339 

0.8364 

0.8388 

0.8413 

1.00 

08437 

08461 

08484 

08508 

08531 

08554 

08576 

08598 

0.8631 

0.8643 

1.10 

08684 

asoss 

08707 

0.8737 

08748 

08768 

08789 

08808 

08838 

0.8848 

1.20 

08887 

04488 

08004 

08933 

08941 

08969 

08977 

0.8995 

09013 

0.9039 

1.30 

0.9048 

09083 

09079 

09096 

09111 

0.9137 

09143 

09158 

09173 

0.9188 

1.40 

08303 

0.9317 

09333 

09348 

0.9380 

09373 

09387 

09300 

0.9313 

0.9336 

1J0 

0.9339 

09351 

09384 

09376 

08388 

09400 

09411 

0.9433 

09434 

09445 

1.00 

39458 

09488 

09477 

09487 

09497 

09607 

09617 

0.9637 

09637 

0.9546 

1.70 

0.9555 

0.9684 

09673 

09683 

09690 

09699 

09607 

09615 

0.9633 

0.9631 

1.80 

08839 

0.9647 

09864 

09861 

09689 

09676 

09883 

09690 

09696 

0.9703 

1.90 

0.9709 

0.9716 

09733 

09738 

09734 

09740 

09746 

09751 

0.9757 

0.9763 

3.00 

08787 

0.9773 

09778 

09783 

09788 

09793 

09797 

09803 

0.9806 

0.9811 

3.10 

08815 

09830 

09834 

09838 

09833 

09836 

09840 

09843 

09847 

0.9851 

3.30 

0.9854 

0.9858 

09881 

09864 

09888 

09871 

0.9874 

09877 

09880 

0.9683 

3JO 

39888 

0.9889 

09891 

09894 

09897 

09899 

0.9903 

09904 

0.9906 

0.9909 

3.40 

0.9911 

09913 

09915 

09918 

0.9930 

09933 

09934 

0.9936 

0.9938 

0.9939 

3 JO 

0.9931 

09033 

09936 

09936 

09938 

09940 

09941 

0.9943 

09944 

0.9946 

3.00 

39947 

09949 

09960 

09961 

09963 

0.9964 

0.9955 

0.9956 

0.9967 

0.9959 

3.70 

0.9900 

09961 

09983 

0.9963 

0.9984 

0.9965 

09966 

0.9967 

0.9968 

0.9968 

3.80 

0.9989 

09970 

09971 

0.9973 

09973 

0.9973 

0.9974 

0.9975 

0.9976 

0.9976 

3.90 

0.9977 

0.9978 

0.9978 

0.9979 

0.9979 

09980 

0.9961 

0.9981 

0.9983 

0.9983 

3.00 

0.9983 

09983 

09984 

09964 

0.9985 

09985 

0.9985 

0.9966 

09986 

0.9987 

3.10* 

0.9987 

09987 

09988 

0.9988 

0.9989 

09969 

09969 

0.9989 

0.9990 

0.9990 

3.30 

39990 

09991 

09991 

09991 

0.9991 

09993 

09993 

09993 

0.9993 

0.9993 

3J0 

0.9993 

09993 

09993 

09994 

0.9994 

0.9994 

0.9994 

0.9994 

0.9994 

0.9995 

3.40 

0.9996 

09906 

0.9996 

0.9996 

0.9995 

0.9996 

0.9996 

0.9996 

0.9996 

0.9996 

3.80 

0.9996 

09996 

09996 

09907 

09997 

0.9997 

0.9997 

0.9997 

0.9997 

0.9997 

3.00 

0.9997 

09997 

09997 

09997 

0.9996 

0.9996 

0.9996 

0.9996 

0.9998 

0.9998 

3.70 

0.9998 

0.9998 

09998 

0.9998 

09996 

0.9998 

0.9996 

0.9996 

0.9998 

0.9999 

380 

0.9999 

0.9909 

0.9999 

0.9999 

0.9999 

09999 

09999 

0.9999 

0.9999 

0.9999 

390 

08999 

0.9999 

0.9999 

0.9999 

09999 

0.9999 

0.9999 

0.9999 

09999 

0.9999 

mm 


Table  3:  Approximate  cdf  of  the  standardized  mean  of  samples  from  a  logistic  popula¬ 
tion:  Sample  size  n  =  15. 


.04 


0.5181 

0.5563 

04)957 

0.8343 

0.6715 


07071 

0.7407 

0.7732 

0.8013 

0.8281 


ass  23 
0.8742 
0.8938 
0.9107 
0.9257 


0.9386 

0.9497 

0.9691 

0.9689 

0.9735 


0.9789 

09634 

0.9870 


0.9999 

0.9999 


0.9994 

0.9998 


0.9997 


0.9999 

0.9999 

0.9999 


.06 


05201 

05802 

0J996 

06381 

08751 


07106 

07439 

07752 

08041 

08306 


0.8548 

0.8763 

0.8964 

09123 

0.9271 


0.9607 

09699 

09677 

09741 


09794 

0.9638 

09873 

09901 

0.9934 


09942 

09968 

0.9967 

09975 

0.9981 


09966 

09990 

09993 

0.9996 

0.9998 


0.9997 

0.9996 

0.9999 

0.9999 

0.9999 


05242 

0.5282 

05322 

05642 

0.5682 

06035 

0.6074 

06419 

06456 

06494 

0.6788 

06824 

07140 

07174 

07208 

07472 

07504 

07535 

07782 

07812 

07841 

0.8123 

0.8331 

0.8356 

0.8381 

0.8591 

08782 

08802 

mi 

0.8972 

08990 

09007 

0.9139 

0.9154 

09170 

0.9284 

0.9298 

09311 

09410 

0.9431 

09432 

09617 

09627 

0.9636 

09607 

0.9616 

0.9624 

09684 

0.9691 

09697 

0.9747 

0.9753 

0.9758 

0.9943 

0.9967 

0.9967 

0.9976 

0.9982 


0.9967 

0.9990 

0.9993 

0.9995 

0.9996 


0.9997 


09879 

0.9906 

0.9928 


0.9945 

0.9968 

0.9968 

09976 


0.9987 

0.9990 

0.9993 

0.9995 

0.9996 


0.9997 

0.9998 

0.9999 

0.9999 

0.9999 


0.9909 

0.9930 


0.9946 

0.9959 

0.9969 

09977 

0.9983 


0.9987 

0.9991 

0.9993 

0.9995 

0.9996 


0.9997 

0.9996 

0.9999 

0.9999 

0.9999 


0.6761 

0.6152 


0.7241 

07567 

0.7870 

0.8150 

0.8405 


0.8636 

0.8842 

0.9024 

0.9185 

0.9324 


0.9646 

0.9632 

0.9704 

0.9764 


0.9613 

09853 

0.9685 

0.9911 

0.9932 


0.9948 

0.9960 

0.9970 

0.9978' 

0.9983 


0.9988 

0.9991 

0.9993 

0.9995 

0.9997 


0.9998 

09998 

0.9999 

0.9999 

0.9999 


Table  4:  Approximate  quantiles  of  the  standardized  mean  of  samples  from  a  logistic 
population  using  Comish-Fisher  series  expansion. 


Table  5:  A  comparison  of  four  approximations  for  cdf  of  standardized  mean  of  samples 
of  size  3  from  logistic  populations. 


*  1  *3(*) 

m*)-*(*) 

W*)  -  G3(x) 

m*)  -  T*(x) 

f5(*)  ~  G'3(x) 

0.05 

0.5209 

0.0010 

0.0000 

0.0001 

0.0000 

0.15 

0.5625 

0.0029 

0.0000 

0.0003 

0.0000 

0.25 

0.6033 

0.0046 

0.0008 

0.0005 

0.0000 

0.45 

0.6809 

0.0073 

-0.0017 

0.0007 

0.0001 

0.65 

0.7506 

0.0084 

-0.0006 

0.0007 

0.0000 

0.85 

0.8106 

0.0083 

-0.0007 

0.0007 

0.0000 

1.00 

0.8486 

0.0073 

-0.0008 

0.0004 

0.0000 

1.20 

0.8903 

0.0054 

-0.0007 

0.0002 

0.0000 

1.45 

0.9291 

0.0026 

-0.0004 

0.0000 

0.0000 

1.75 

0.9598 

-0.0001 

0.0001 

-0.0002 

0.0000 

2.50 

0.9918 

•  -0.0020 

0.0004 

0.0002 

0.0000 

3.00 

0.9975 

-0.0012 

0.0001 

0.0001 

0.0000 

#3  (*)  =  cdf  of  the  standardized  mean  of  3  iid  logistic  r.v.’s. 
♦(*)  =  cdf  of  the  standard  normal  r.v. 

G3(x)  =  Edgeworth  series  expansion  correct  to  order  n_1 
Ts(x)  =  cdf  of  the  standardized  Student’s  t  r.v.’s  with  19  d.f. 
G'3(x)  =  Edgeworth  series  expansion  correct  to  order  n~3 
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Table  7:  Quantiles  of  the  standardized  mean  of  logistic  variates. 


Sample  rim 

ProbahsHty(ar) 

B 

0.900 

a  950 

0.975 

0.990 

0.996 

1.2452 

1.6306 

1.8757 

2.4298 

2.7560 

2 

1.2432 

1.6340 

1.9961 

2.4500 

2.7718 

1.2425 

1.6326 

1.9934 

2.4450 

2.7755 

1.2617 

1.6381 

1.9760 

2-3861 

2.6778 

4 

1.2612 

1.6393 

1.9790 

2.2906 

2.6821 

1.2612 

1.6392 

1.9789 

2.3905 

2.6624 

12564 

1.6395 

1.9734 

2.3756 

2.6597 

S 

1.2651 

1.6403 

1.9755 

2.3786 

2.6642 

1.2651 

1.6403 

1.9754 

2.3786 

2.6625 

1-2697 

1.6411 

1.9750 

2.3628 

2.6376 

7 

1.2696 

1.6416 

1.9712 

2.3644 

2.6390 

1.2696 

1.6416 

1.9712 

2.3644 

2.6390 

1.2731 

1.6423 

1.9674 

2.3526 

2.6201 

10 

1.2731 

1.6425 

1.9680 

2.3534 

2.6206 

1.2731 

1.6425 

1.9680 

2.3534 

2.6208 

1.2745 

1.6427 

1.9662 

2.3484 

2.6131 

12 

1.2745 

1.6426 

1.9667 

2.3491 

2.6135 

1.2745 

1.6429 

1.9667 

2.3490 

2.6135 

1.2759 

1.6432 

1.9651 

2.3443 

2.6059 

IS 

1.2755 

1.6433 

1.9654 

2.3446 

2.6062 

1.2759 

1.6433 

1.9654 

2.3446 

2.6062 

Top  afaaranl  in  each  oa&  repeal anti  Student's  t  approximation 
Middle  ele mailt  is  each  call  repraaanta  the  exact  percentage  paint. 

Bottom  aUmaui  in  each  call  ritpraaanta  the  Comiah-Fiahar  aeriea  approximation  (n~* ). 


4.00 

0.06 

0.19 

0.33 

a75 

2.00 

0.20 

0.77 

134 

2.83 

1.00 

an 

3.22 

5.40 

10.94 

0.50 

3.51 

13.07 

21.63 

43.43 

aio 

oaso 

328.43 

541.09 

108236 

4.00 

ai3 

0.30 

a47 

0.93 

3.00 

0.46 

1.31 

1.85 

3.42 

1.00 

1.98 

4.93 

7.36 

13.25 

aso 

an 

19.86 

39.40 

52.49 

aio 

306.48 

497.34 

734.49 

1308.66 

4.00 

0.16 

0.37 

0.55 

1.05 

3.00 

0.68 

1.49 

2.17 

3.78 

1.00 

3.74 

5.99 

ass 

14.61 

0u50 

11.33 

24.03 

34.06 

57.86 

aio 

282.89 

601.00 

850.48 

1441.89 

4.00 

a20 

0.43 

aei 

1.13 

3.00 

0.80 

1.69 

2.39 

4.04 

1.00 

3.34 

are 

9.40 

15.58 

0.50 

13.56 

37.04 

37.40 

61.67 

aio 

340.80 

675.85 

933.47 

1536.53 

4.00 

0.33 

0.59 

an 

1.32 

3.00 

1.38 

2.38 

3.03 

4.76 

1.00 

5.12 

a96 

11.80 

18.29 

0.50 

30.50 

35.66 

46.86 

73.38 

0.10 

512.43 

889.88 

1168.58 

1803.84 

4.00 

0.39 

0.68 

0.93 

1.44 

2.00 

1.56 

2.n 

3.39 

5.15 

1.00 

6.14 

10.17 

13.12 

19.77 

aso 

24.41 

40.39 

52.01 

78.17 

aio 

609.06 

1007.16 

1296.45 

1946.93 

Vv  - "r-  *  -  ^ 


Table  11:  Performance  characteristics  of  the  means  rule  Ri  under  the  equally 
spaced  configuration. 


— FT 

ago,  n: 

-3 

- 1 

* 

W*  1 

_ 

03^ 

1.0 

nr 

28 

nr 

4.0 

nr 

3 

1 

0.834 

0.717 

0.584 

0.440 

(UOS 

0.192 

a069 

0.013 

3 

a»48 

0.975 

0.989 

0.996 

0.998 

0.999 

1.000 

1.000 

1.773 

1.893 

1873 

1.436 

1.303 

1.193 

1.068 

1.013 

B(S*) 

a834 

0717 

0884 

a440 

0.30ft 

0.192 

0.069 

0.013 

B(SB) 

3.730 

3.687 

3.583 

2.431 

2801 

2.191 

2.058 

2.013 

=1 

P) 

0.886 

0.548 

0.788 

0.718 

0.851 

0-596 

0829 

0.506 

1 

0.754 

0811 

0.254 

aoss 

0.022 

0.004 

0.000 

0.000 

3 

3 

0876 

0.806 

0.697 

0883 

a418 

0.288 

aioi 

0.025 

3 

0.983 

0.986 

0.996 

a998 

0.999 

1.000 

1.000 

1.000 

B 

w~ 

3.593 

3803 

1.945 

1.648 

1.439 

1.289 

1.101 

1.025 

E(S*) 

1.830 

1816 

0961 

aeso 

a440 

a289 

0.101 

0.025 

E(SR) 

5.394 

5.080 

4.631 

4806 

3858 

3874 

3.302 

3.050 

E(P) 

0.884 

0.767 

0.848 

0.549 

0.480 

0.430 

0.387 

0843 

1 

0.681 

0.376 

0.051 

0.004 

aooo 

aooo 

0.000 

0.000 

4 

3 

0.799 

0880 

0813 

aii8 

aasi 

0.008 

aooo 

aooo 

3 

0.903 

0847 

0.753 

a  638 

a  454 

0844 

ai32 

0.038 

4 

0.973 

0.990 

0.998 

a  990 

1.000 

1.000 

1.000 

1.000 

E(s) 

3^35 

3.693 

3.112 

1.747 

1815 

1850 

1.132 

1.038 

E(S«) 

3.383 

1.703 

1.116 

a749 

asio 

0850 

0.132 

0.038 

E(SR) 

8855 

7.938 

6.917 

aii4 

6813 

5.043 

4897 

4.107 

-JjPL. 

0.834 

0.873 

0838 

0.437 

0879 

a337 

0.283 

0.359 

1 

0847 

0.103 

0.005 

aooo 

aooo 

0.000 

aooo 

0.000 

3 

0702 

0818 

0.085 

0.008 

aooo 

aooo 

aooo 

aooo 

s 

3 

0839 

0.636 

0.356 

ai43 

ao40 

aoos 

aooo 

0.000 

4 

0830 

0.873 

0.788 

a668 

0.529 

0.358 

ai57 

0.045 

5 

0.978 

0.993 

0.997 

0.999 

1.000 

1.000 

1.000 

1.000 

3.975 

3.911 

2.308 

1815 

1.589 

1.394 

1.157 

1.045 

B(S») 

3.998 

1.919 

1.311 

asis 

0889 

a394 

ai57 

0.045 

E(SR) 

13.004 

11.068 

9830 

aio8 

7.234 

8868 

5.829 

5.179 

.  EjP) 

a796 

0883 

0.443 

0.383 

a314 

0.279 

0.331 

0.209 

1 

0.053 

0.000 

aooo 

0.000 

r  a 

0. 

0. 

0. 

3 

aii4 

0.000 

0.000 

aooo 

0.000 

0. 

0. 

0. 

3 

0.318 

0.001 

aooo 

0.000 

aooo 

0.000 

a 

0. 

4 

0.354 

0.007 

aooo 

0.000 

0.000 

0.000 

0. 

0. 

10 

S 

0815 

0.043 

0.000 

aooo 

0.000 

0.000 

aooo 

a 

6 

0.873 

0.174 

aoio 

0.000 

aooo 

0.000 

0.000 

0.000 

7 

asos 

0.441 

aii5 

0.013 

aooi 

0.000 

aooo 

0.000 

8 

0898 

0.737 

U477 

0.233 

0.073 

0.017 

0.000 

0.000 

9 

0.955 

0.933 

0.861 

0.767 

a645 

0.504 

0.338 

0.078 

10 

0.989 

0.997 

0.999 

1.000 

1.000 

1.000 

1.000 

1.000 

Ei 

5) 

5.584 

3831 

2.483 

3.003 

1.718 

1.520 

1.339 

1.078 

E(S*) 

4876 

3834 

1.483 

1.004 

0.718 

0.520 

0.339 

0.078 

E(SR) 

40.308 

38.535 

23.418 

18.778 

ia388 

14.887 

12.147 

ia705 

E(P) 

0.558 

0.333 

0.348 

0.300 

ai73 

0.152 

0.124 

0.108 

40 


Table  12:  Performance  characteristics  of  the  means  rule  R\  under  the  slippage 
configuration. 
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Table  13:  Performance  characteristics  of  the  medians  rule  Rj  under  the  equally 
spaced  configuration. 


a  780 

0.463 

0.188 

1  a  039 

0.003 

0.000 

1  0.000 

0.961 

0.901 

0.810 

0.673 

0.499 

aisi 

I'"'  ' *1 

0.999 

1.000 

1.000 

1.000 

1.000 

3.739 

3.363 

1.968 

1.701 

1.503 

1.181 

1.039 

1.731 

1363 

a968 

0.701 

asos 

aisi 

0.039 

8.677 

8.361 

4.778 

4.374 

4.003 

3363 

3078 

0.910 

0.787 

0.688 

3567 

0.801 

0394 

0346 

0389 

0300 

aooe 

aooo 

0.000 

0.000 

aooo 

0.000 

0.949 

0.694 

0.188 

aoii 

0.000 

0.000 

0.979 

0.918 

0.734 

0,363 

0.113 

0.019 

aooo 

0.000 

3993 

0.968 

0.967 

0.939 

0388 

0.743 

a403 

0.135 

0.996 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

4308 

3896 

3.884 

3333 

1.971 

1.763 

1.403 

1.138 

3310 

3396 

1.884 

1333 

0.971 

0.763 

0.403 

0.135 

14.687 

13379 

11.430 

9.888 

3771 

3038 

3611 

5.801 

0.779 

0.877 

0.464 

0.394 

0.383 

0.381 

0.338 

0.889 

0.991 

1.000 


3.413 

3.413 

39.730 

0.341 


0. 

a 

a 

0. 

0.000 

0.000 

0.000 

0.003 

0.683 

1.000 


1.684 

0.684 

13157 

0.168 


Table  15:  Comparison  of  the  rule  Ri  to  Rj-  Equally  spaced  configuration. 


PadChar. 


P{CS) 
B(S) 
S(S-) 
E(SR ) 
B(P) 


P*  m  Q.90,  ns! 


<V»»  i  J 


Ss/n  m  3.0 


P*  I  Ri/R* 


P*  m  0.00,  ns  5, 


|  f/i  s  1.S 

Sy/n  s  3.0 

Pad  Char. 

Pj 

Rl/Rl 

Pi 

P» 

«l /«a 

P(CS) 

0.997 

1.000 

0997 

1.000 

1.000 

1.000 

B(S) 

2.114 

3.371 

0627 

1.352 

2.000 

0.576 

E(S*) 

1.117 

2.371 

0471 

0352 

1.000 

0.352 

E(SR ) 

6.920 

9.268 

0747 

5.049 

0907 

0.731 

B(P) 

0628 

0.843 

0626 

0.338 

0500 

0876 

Pad  Char. 


P(CS) 
B(S) 
B(S*) 
E(SR ) 
B{P) 


Pm  s  ass,  ns  3, 


Sy/n  m  1.S 


Sy/nm  3.0 


P» 

P1/P2 

1.000 

1.000 

1.951 

0767 

0961 

0.522 

0784 

0.807 

0.488 

0766 

P*  s  0.96,  n»  5, 


Sy/n  s 

L.5 

Sy/nm  3.0 

PadChar. 

Pi 

Pj 

Pl/Pj 

Pi 

Pj 

Pi /Pa 

P{CS) 

0.999 

1.000 

0.999 

1.000 

1.000 

1.000 

E(S) 

2.400 

3.836 

0.626 

1.494 

2.396 

0.624 

E(S •) 

1.401 

2.836 

0494 

0.494 

1.396 

0.354 

E(SR) 

7.540 

9.821 

0.768 

5.467 

7.778 

0.703 

E(P) 

0600 

0.959 

0.626 

0.373 

0.599 

0.623 

t.l  4,11.1  tj  l.|  a  | 
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Table  16:  Comparison  of  the  rule  Ri  to  R2:  Slippage  configuration. 
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3  AN  ELIMINATION  TYPE  TWO-STAGE  PRO¬ 
CEDURE  FOR  SELECTING  THE  POPULA¬ 
TION  WITH  THE  LARGEST  MEAN  FROM 


k  LOGISTIC  POPULATIONS 

3.1  Introduction 

It  is  unrealistic  to  assume  that  we  always  have  k  populations  with  a  common  known 
variance.  When  the  variances  are  unknown,  it  is  not  possible  to  predetermine  the  sam¬ 
ple  size  for  a  single-stage  procedure  since  the  standard  errors  of  the  sample  means  are 
unknown.  (See,  for  example,  Dudewicz  (1971)).  Bechhofer,  Dunnett  and  Sobel  (1954) 
have  considered  a  two-stage  non-elimination  type  procedure  in  which  the  observations 
in  the  first  stage  are  only  used  to  obtain  an  estimate  of  the  common  unknown  vari¬ 
ance.  Gupta  and  Kim  (1984)  considered  an  elimination  type  two-stage  procedure  for 
the  case  of  common  unknown  variance  and  they  showed  that  their  procedure  performs 
much  better  than  the  non-elimination  type  procedure  of  Bechhofer,  Dunnett  and  Sobel 
(1954). 

For  selecting  the  population  having  the  largest  mean  from  normal  populations  with 
equal  known  variance  a 2,  Cohen  (1959),  A  lam  (1970)  and  Tamhane  and  Bechhofer 
(1977,  1979)  have  all  studied  two-stage  elimination  type  procedures,  in  which  they 
used  Gupta’s  (1956,  1965)  subset  selection  procedure  in  the  first  stage  to  screen  out 
non-contending  populations  and  Bechhofer ’s  (1954)  indifference  zone  approach  to  all 
populations  retained  in  the  second  stage. 

Tamhane  and  Bechhofer  (1977)  studied  in  depth  a  two-stage  elimination  type  pro¬ 
cedure  ('Pj)  for  selecting  the  largest  normal  mean  when  the  common  variance  is  known. 
In  order  to  determine  a  set  of  constants  necessary  to  implement  P, ,  they  proposed  a 
criterion  of  minimizing  the  maximum  over  the  entire  parameter  space  of  the  expected 
total  sample  size  required  by  P 3  subject  to  the  procedure’s  guaranteeing  a  specified 
probability  of  a  correct  selection.  As  a  consequence,  Pj  based  on  this  unrestricted 
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minimax  design  criterion  possesses  the  highly  desirable  property  that  the  expected  to¬ 
tal  sample  size  required  by  V\  is  always  less  than  or  equal  to  the  total  sample  size 
required  by  the  best  competing  single-stage  procedure  of  Bechhofer  (1954),  regardless 
of  the  true  configuration  of  the  population  means.  Due  to  the  difficulties  of  determin¬ 
ing  the  LFC  of  the  population  means  for  Jb  >  3,  and  of  evaluating  the  probability  of 
a  correct  selection  associated  with  V^  when  the  population  means  are  in  that  configu¬ 
ration,  they  adopted  a  lower  bound  to  the  probability  of  a  correct  selection  of  Vj  and 
obtained  a  set  of  constants  which  provides  a  conservative  solution  to  the  problem. 

In  this  chapter  we  consider  an  elimination  type  two-stage  procedure  for  selecting 
the  logistic  population  with  the  largest  population  mean  when  the  populations  have  a 
common  known  variance. 

We  propose  a  two-stage  elimination  type  procedure  Vi  and  a  non-linear  optimiza¬ 
tion  problem  by  using  a  minimax  criterion  to  find  a  set  of  constants  needed  to  imple¬ 
ment  Vi-  We  derive  lower  bounds  on  the  probability  of  a  correct  selection  and  the 
infimum  over  the  preference  zone  of  the  lower  bounds.  We  determine  the  supremum 
of  the  expected  total  sample  size  needed  for  Vi  over  the  whole  parameter  space.  We 
provide  tables  of  constants  to  implement  Vi  and  of  the  efficiency  of  Vi  relative  to  the 
single-stage  procedure  V\  considered  in  the  previous  chapter  for  the  two  special  cases 
of  the  equally  spaced  and  slippage  configurations. 

3.2  Preliminaries 

Let  ir,-,  x  =  1, . . . ,  Jb,  denote  Jb  logistic  populations  with  unknown  means  /*,  and  a  com¬ 
mon  known  variance  <r2,  and  let 

D  =  {/?  =  (fit,  •••,/**);  -oo  <m  <  oo,i  =  l,...,fc} 
be  the  parameter  space.  Denote  the  ranked  values  of  the  ^  by 

M(i]  <  •  •  *  < 


and  let 


6a  =  m  -  m- 


We  assume  that  the  experimenter  has  no  prior  knowledge  concerning  the  pairing  of  the 
Xi  with  the  M[j]t  <  =  1, . . . ,  k,  j  =  1, . . . ,  k.  Let  denote  the  population  associated 
with  ny\. 

The  goal  of  the  experimenter  is  to  select  the  ‘best’  population  which  is  defined  as 
the  population  with  the  largest  mean.  This  event  is  referred  to  as  a  correct  selection 
(CS).  The  experimenter  restricts  consideration  to  procedures  (‘ P )  which  guarantee  the 
basic  probability  requirement 

Pa[CS\V\  >  V/?€n(5),  (45) 

where  6  >  0  and  1/Jfc  <  P*  <  1  are  specified  prior  to  the  start  of  experimentation  and 

*  {/*  €  ft|  (MM  “  M[*-i])  >  5} 

which  is  defined  as  the  preference  zone  for  a  correct  selection. 

Here  we  propose  an  elimination  type  two-stage  procedure  'P2='P2(n1,  n2,  h)  which 
depends  on  non-negative  integers  m,  n2  and  a  real  constant  h  >  0  which  are  determined 
prior  to  the  start  of  experimentation.  The  constants  (nlt  n2,  h)  depend  on  Jfc,  6  and  P* 
and  they  are  chosen  so  that  Vt  guarantees  the  basic  probability  requirement  (45)  and 
possess  a  certain  minimax  property. 

Procedure  Vj\ 

Stage.  1:  Take  n2  independent  observations  1 

t  J  ™  1)  •  •  •  j  Sjj 

from  each  x<,  *  =  1, . . . ,  k,  and  compute  the  k  sample  means 

=  <-i . k. 

jail 

Let  Xjfcj*  =  \  Determine  the  subset  I  of  {1, .. .  ,fc}  where 

i = wx!"  >  4‘i’  -  wym), 


and  let  xj  denote  the  associated  subset  of  {x2, . . . ,  x*}. 


1.  If  t /  consists  of  one  population,  stop  sampling  and  assert  that  the  population 


associated  with  is  best. 


2.  If  *7  consists  of  more  than  one  population,  proceed  to  the  second  stage. 


Stage  2:  Take  additional  independent  observations  Xjj\  j  =  l,...,n3,  from 
each  population  in  *7,  and  compute  the  cumulative  sample  means 


*  - 


1 


nx  +  n  j 


for  *  €  I,  where 


>-1 


Assert  that  the  population  associated  with  max{Ci  X,  is  the  best. 


Remark  3.1  If  h  =  0  the  two-stage  procedure  Vi  reduces  to  the  single-stage  procedure 
Vi  which  was  considered  in  Section  2.S  with  single-stage  sample  size  n  =  nx  per  popu¬ 
lation.  Also  the  rule  determining  I  in  the  first  stage  is  of  the  type  of  the  subset  selection 
procedure  considered  in  Section  2-4 . 


There  is  an  infinite  number  of  combinations  of  (nx,n2,  h)  for  given  k,  6  and  Pm, 
which  will  exactly  guarantee  the  basic  probability  requirement  given  by  (45),  and 
different  design  criteria  lead  to  different  choices.  We  will  consider  one  of  these  criteria. 
Let  S'  denote  the  cardinality  of  the  set  I  in  stage  one  and  let 


5  -  p*  if5'  =  1 

(  S';  if  S' >  1. 

Then  the  total  sample  size  required  by  Vi,TSS  say,  is 


(46) 


TSS  —  kni  +  Sn3. 


Let  En[TSS\Vi]  denote  the  expected  total  sample  size  for  Vi  under  fi. 


We  adopt  the  following  unrestricted  minimax  criterion  to  make  a  choice  of  (ni ,  n2,  h) 
as  well  as  to  have  the  total,  sample  size  TSS  small.  For  given  k  and  specified  6  and 
P ",  choose  h)  to 

minimize  sup  E$TSS\Vi\ 

9€Q 

subject  to  'hrf^Pit[CS\'P2]>  P”,  (47) 

where  (n1?  n2)  are  non-negative  integers  and  h  >0. 

For  any  population  whose  sample  mean  has  the  MLR  property,  Bhandari  and 
Chaudhuri  (1987)  proved  that  the  least  favorable  configuration  (LFC)  of  the  two- 
stage  population  means  problem  is  a  slippage  configuration.  However,  the  problem  of 
evaluating  the  exact  probability  of  a  correct  selection  in  the  LFC  associated  with  V2 
is  complicated  and  still  remains  to  be  solved.  Here  we  will  consider  lower  bounds  for 
Pf[CS\V2\  and  construct  conservative  two-stage  procedures. 


3.3  Lower  bounds  for  the  probability  of  a  correct  selection 
for  V2 


In  this  section  we  derive  lower  bounds  for  Pn\CS\P^.  These  lower  bounds  will  prove 
to  be  particularly  useful  since  we  will  prove  that  they  achieve  their  infimum  over  fl(£) 
at  n(S)  which  has  components 

t*  =  P[i]  =  •  •  •  =  P[k_i]  =  fi[k]  -  6,  S  >  0. 

This  result  will  permit  us  to  construct  a  conservative  two-stage  procedure  which  guar¬ 
antees  the  basic  probability  requirement  (45). 

The  next  theorem  gives  one  of  these  lower  bounds  for  P${CS\V-i[. 

Theorem  3.1  For  any  p  €  H  we  have 


p>[cs\v,\ 

MO  til 

>  /  II*h(*  +  ^%y/n\l<r  +  h)dFni(x) 

J~°°  iml 

MO  ^ 

+  /  II  +  SkiVnl  +  nll°  )^n»+n,(x)  ~  1 


(48) 


where  F »(x)  is  the  cdf  of  the  standardized  sample  means  of  size  n  from  L(fi,<r3). 


Proof 

For  any  /I  €  ft  we  have 
Pg[CS\V2) 

=  -^[^(1)  ^  -  Wv^T ,  *  #  *,  X(*j  >  max,€iX(,-)] 

>  -  hafy/n^  ,XW  >  X(j),  V*  ^  fc] 

+Pjr(X(fc)  >  T(i),  V*  7*  fc]  -  1,  (49) 

since  P(AnB)  >  P(A) +P(B)  —  1  for  any  two  events  A  and  B.  Then  a  straightforward 
computation  leads  to  the  conclusion  of  this  theorem.  □ 

Corollary  3.1  For  all  p  £  0(6)  we  have 

^PACSW  >  (50) 

r  {a.(*+Mu '/»  +*)}‘-’^(x) 

J~oo 

+  /  +  &Vni  +  n2/<r  )}fc-1dPn,+B9(x)  —1.  (51) 

*-09 

Proof 

The  proof  follows  immediately  on  noting  that  the  right  hand  side  of  (48)  is  non- 
decreasing  in  each  Su  for  t  =  1, . . . ,  Jfe  -  1.  □ 

Remark  3.2  Since  the  right  hand  side  of  (51)  is  strictly  increasing  in  each  of  m, 
nj  +  r»a  and  h  and  tends  to  one  as  r»i  or,  nj  and  h  tend  to  oo,  we  see  that  the  basic 
probability  requirement  (45)  can  be  guaranteed  if  one  (or  more)  of  these  constants  is 
chosen  sufficiently  large. 

Remark  3.3  If  we  let  h  — ♦  oo  on  the  right  hand  side  of  (48)  we  obtain 

/to  fc-1 

II  ^»i+na(x  +  WShTnJAr  )dFni+n,(x) 

OO  ■  , 


«' .*4  ^ «JU\ 


which  is  an  expression  for  P$[C  S\V\\  where  V\  uses  a  common  single-stage  sample  size 
n  =  r»i  +  nj  per  population.  Thus  V\  is  a  special  case  of  Vi  based  on  a  conservative 
lower  bound  and  hence  Ep[TSS\Vi]  <  Jfcn  for  all  p  £  Cl. 

Remark  3.4  The  distribution  of  the  mean  of  samples  from  logistic  population  has  the 
monotone  likelihood  ratio  (MLR)  property  with  respect  to  the  location  parameter  (Goel 
(1975))  and  hence  the  distributions  of  the  X^  and  X^  are  stochastically  increasing 
(SI)  families  in  pi,  i  =  1, . . . ,  jfe. 

Remark  3.5  The  cumulative  sample  means 

X,  =  -22— X?>  +  -22— Xf 

nj  +  ni 

are  strictly  increasing  in  each  X^,  j  =  1,2,  *  =  1, . . . ,  k. 

We  can  now  find  another  lower  bound  to  the  Pp\CS\Vi\  given  in  the  following 
theorem  by  noting  the  facts  mentioned  in  Remark  3.4  and  Remark  3.5.  This  lower 
bound  can  be  shown  to  be  uniformly  superior  to  the  one  given  in  Theorem  3.1.  It  is 
also  straightforward  to  determine  the  LFC  of  the  population  means  relative  to  this  new 
lower  bound. 

Theorem  3.2  For  any  p  €  Cl  we  have 
inf  Pn[CS\Vi) 

<X€0  («)  1  ' 

>  f°{F„,(x  +  Vm/*  +/>)}*-'<(/'„,  (X) 

J—O o 

•  /  {Fni+n,(x  +  Sy/nx  +  ni/o  )}*_1dFni+na(x),  (52) 

J  —OO 

where  Fn(x)  is  the  cdfofthe  standardized  sample  mean  of  size  n  from  L(p,  a2). 

Proof 

Let  F(.|/ii)  and  G(.|/ij)  denote  the  cdf’s  of  the  X*1'  and  X,  respectively  and  let 
H(.,.\pi)  denote  the  joint  cdf  of  the  X^  and  X^.  Then  F(.|/ij),  C7(.|m<)  and  H(.,.\pi) 


52 


are  non-increasing  in  /i<,  t  =  1, . . . ,  i,  from  Remark  3.4  and  Remark  3.5.  Without  loss 
of  generality  we  may  assume  that  /*!<•••<  Hk-  Then  for  all  Ji  €  H(<5), 


Pf[CS \v2\ 

=  >  maxi<j<fc  —  ha/ y/fii,  ^(t)  —  maxi€l^(i)l 

>  x(*)  ^  X(i)»  Vi  =  1, . . . ,  *  -  1  ] 

=  r°  flH(x  +  ha/y/n^,y\ni)dH(x,y\iik) 

J-OO  j- <30 

J-OO  J-OO 

=  E„k  +  ha/y/nt,  X(*)|/i*  -  5}], 

where  the  expectation  is  with  respect  to  the  joint  distribution  of  and  ~%(k) 


PtlCSWl  >  ^{S)E^Hk'1(xw  +  WVJ*r,X(fc)|/i*  -  6}} 

and  it  is  enough  to  show  that  for  all  /t  € 


+ Wv/sr,7(t)|w  - «}] 

>  +  W./SIW  -  6))EM[Gl-'(Xm\^  -  «)]. 


By  Remark  3.5,  for  all  a,  b  and  /*, 

<  °.X«>  <  *>) 

=  pa* $  <  <>,x$  <  *%*■(>’  -  sstO 
=  KlPAX S  <  «.*$  <  “■*“(>  -  Ste^a)l^gH 
>  Er[PA^i)  <  «|X$} 

•a{*8>  <  **»(»-  ;fcx$)ix$}] 

=  p„{x$  <  «}p»{x,fl  < »} 


and  hence 


.  Hence 
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>  *„[?wI^i  +  WvS3«-<)] 
■£„[G‘-'{X(,I|W-«}J 

by  the  Chebyshev  inequality  (Hardy,  Littlewood  and  Polya  (1934)),  since 

fall!  +  Wv««  -  <} 

and 

G(X{k)\Pk  -  5} 

are  non-decreasing  in  □ 

Remark  3.6  If  we  let 

<*=  r°  {Fm(x  +  6y/n^/er  +  h)}k~ldFni(x) 

J—  OO 

and 

r°  {# »i+na(*  +  f>y/n\  +  nil  a  )}fc_1di:’ni+Wa(x)  , 
y— oo 

then  (51)  states  that 


inf  Pa[CS\V2]>a  +  b-l 
Jen (f)  J 


and  (52)  states  that 


inf  P?[CS\V2]  >  ab. 

By  noting  that  a  +  6  —  1  <  ab  for  all  a,  b  €  (0, 1),  the  lower  bound  (52)  is  uniformly 
superior  to  the  lower  bound  (51),  ami  hence  we  will  use  the  lower  bound  (52)  henceforth. 

3.4  Expected  total  sample  size  for  % 

In  order  to  solve  the  optimization  problem  (47)  we  first  find  an  analytical  expression  for 
the  Epi[TSS\V2]  and  then  determine  the  sup^€n  Ep[TSS \P2\  and  the  sets  of  ^-values 
at  which  this  supremum  occurs. 


Theorem  3.3  For  any  /»  €  U  we  have 

Ep[TSS\V2 1  =  fcni  +  n2  S  r  {  f[  ^  (*  +  Sijy/r^/a  +  h) 

- — «  * — oo 


>1  i»i 


-n^+wsr/®  -M}^w(*). 


(53) 


/-I 


where  Fn(x)  is  the  cdf  of  the  standardized  sample  means  of  size  n  from  L(fi,  cr2). 

Proof 

For  any  /?  €  fl  we  have 


£*[TSS|Pa]  =  km  +  n2Ep[S  |P3], 
where  5  is  defined  as  in  (46).  Now 

£*[S|P2]  =  ^[S'lPj]  -  PAS'  =  1|P2] 

■=  E-PjC^S  >  xg  -  wvsr .  Vi  #  i] 

-  £  >  Xg  +  *»/  VST ,  V;  #  .•]  (54) 

iml 

and  hence  Theorem  3.3  follows  immediately.  □ 

The  following  theorem  summarizes  the  result  concerning  the  supremum  of  the 
Efi[TSS\V2]  for  /Tefl. 

Theorem  3.4  For  any  ji  €  ft,  fixed  k  and  (ni,n3,  h)  we  have 

sup  Ep[T  S  S]P2] 
ten 

=  km+mT  ({F.l(i  +  5)}‘-'-(^(x-5)}‘-,)<iF.,(i)  (55) 

J— OO 

which  occurs  when  jj  =  •  •  •  =  where  Fn(x)  is  the  cdf  of  the  standardized  sample 
means  of  size  n  from  L(fi,<72). 

Proof 

Noting  Remark  3.4  and  Remark  3.5  we  can  use  the  results  of  Gupta  (1965)  which 
show  that  achieves  its  supremum  for  fi  €  fl  when  /i[i]  =  •  •  •  =  /*[*].  By  the 

similar  argument  PfflS1  =  \\V2]  achieves  its  infimum  when  /i[i]  =  •  •  •  =  /i^j.  Hence  the 
result  follows  immediately  from  Theorem  3.3.  □ 


3.5  Optimization  problem  yielding  conservative  solutions 


In  this  section  we  consider  the  optimization  problem  (47)  which  one  must  solve  in 
order  to  determine  the  constants  (nx,n2,  h)  which  are  necessary  to  implement  "P2.  As 
we  noted  earlier,  the  problem  of  evaluating  the  exact  probability  of  a  correct  selec¬ 
tion  in  the  LFC  associated  with  V2  is  very  complicated.  Thus  we  replace  the  exact 
inf*ren(j)-f/i[C by  the  conservative  lower  bound  given  by  the  right  hand  side  of 
(52),  and  consider  the  following  optimization  problem. 

For  the  given  k,  6  and  P*  choose  the  constants  (n j,  n2,  h)  to 

minimize  km  +  n2  H  [{Fn,  (x  +  h)}fc_1  -  {Fni  (x  -  h)}k~l]dFni  ( x ) 

«/— OP 

subject  to  r°  {F^x  +  6 y/m/ <7  +  h)}^1  dFni(x) 

J—oo 


where  nx  and  n2  are  non-negative  integers  and  h>  0. 

Let  us  denote  by  (m,h2,  h)  the  solution  to  the  optimization  problem  (56).  Then 
we  can  use  the  approximate  design  constants 

nx  =  [nx  +  1],  n2  =  [n2  +  1],  h  =  h , 

where  [z]  denotes  the  greatest  integer  which  is  less  than  z,  to  implement  V2. 

Table  17,  Table  18,  Table  19  and  Table  20  contain  the  constants  (n1?  n2,  h)  necessary 
to  approximate  (nlt  n2,  h)  and  the  values  of  the  expected  total  sample  size  (ETSS)  for 
*  =  2,3,4,5,10,15,  P*  =  0.75,0.90,0.95,0.99  and  5/<r  =  0.1, 0.5, 1.0, 2.0, 4.0.  All 
computations  were  carried  out  in  double-precision  arithmetic  on  a  Vax-1 1/780.  The 
SUMT  (Sequential  Unconstrained  Minimization  Techniques:  Fiacco  and  McCormick 
(1968))  algorithm  is  used  to  solve  the  non-linear  optimization  problem.  A  source 
program  in  Fortran  for  the  SUMT  algorithm  is  given  by  Kuester  and  Mize  (1973). 


3.6  The  performance  of  the  two-stage  procedure  relative  to 
the  single-stage  procedure 

As  a  measure  of  efficiency  of  the  two-stage  procedure  V2  relative  to  that  of  the  single- 
stage  procedure  V\  when  both  guarantee  the  same  basic  probability  requirement  (45), 
we  consider  the  ratio  termed  relative  efficiency  (RE)  Ep[TSS\V2]/kh,  where  h,  is 
the  estimate  of  the  minimum  sample  size  n,  needed  in  the  single-stage  procedure  V\. 
Clearly  RE  depends  on  /?,  6  and  P*.  Values  of  the  RE  less  than  unity  favor  V2  over 

Vi. 

Now  the  RE  is  given  by 

RE  =  TT-[*ni  +  E  { II  ^  (<  +  +  *) 

-T[F^t  +  6iiy/ri/a-h)}dF>.t(t))-  (57) 

where  n,  is  the  solution  of 

r  +  (^l*)6)}k-UFn.(t)  =  P-.  (58) 

«/— oo 

We  consider  the  relative  efficiency  for  two  special  cases,  namely,  the  equally  spaced 
and  the  slippage  configurations.  First,  for  the  equally  spaced  configuration,  we  assume 
that  the  unknown  means  of  the  k  populations  axe  fi,  fx  +  6,...,  n  +  (k  —  l)6  which  have 
ranks  1,  2, . . . ,  Jfe,  respectively.  Let  REeq  denote  the  relative  efficiency  with  respect  to 
the  above  configuration.  Then,  since  6ij  =  fi[{\  —  n\j]  =  (i  —  j)S, 

RE e,  =  TT-[fcnX  +  fl2  £  /  {  II  (*  +  >/*»(*  “  j)S/ a  +  h) 

fgxi  oo  jml 
>#• 

-  n  (59) 

>-> 

Next,  for  the  slippage  configuration,  we  assume  that  the  unknown  means  of  the  k 
populations  are  fxy]  =  n,  j  =  1, .  •  • ,  k  —  1,  and  fi[k\  =  (i  +  6,  S  >  0.  Then  the  relative 


.'  ^‘  ‘  >  ••■  «.«  >>4  »j  ■,m_uu;wt 


efficiency  with  respect  to  the  above  configuration,  RE^,,  is  given  by 


RE„  = 


rk**!  +  na{(i  -  1)  r  ( F i, (<  +  *)-  F»,  (<  -  A))*~* 

KTlg  «/— oo 

•(F«,(«  -  ^M/<r  +  i)  -  F»,(f  -  Vft7tf/<r  -  h))dF*,(t) 

+  /_JF»,(i  +  ^/<t  +  A)-F»,((+  VM/ff  -  htf-'dFi, (()}].  (60) 

Table  21  and  Table  22  give  the  values  of  the  RE^  and  RE„  for  given  values  of 
P *  =  0.75,0.90,0.95,0.99,  it  =  2,3,4,5,10,15  and  S/a  =  0.1, 0.5, 1.0, 2.0, 4.0. 

For  any  values  of  P*,  k  and  6 ,  REeq  ^  1  and  RE, p  <  1  and  hence  the  two-stage 
procedure  is  more  efficient  than  the  single-stage  procedure  in  terms  of  the  expected 
total  sample  sizes.  Furthermore,  the  effectiveness  of  Vi  appears  to  be  increasing  in  k 
since  the  values  of  REgg  and  RE ^  are  decreasing  in  k. 


S 


Table  17:  Constanta  to  implement  the  two-stage  procedure  Vi  for  selecting  the  largest 
logistic  population:  P*  =  0.75. 


|  P*  a  0.75  | 

k 

S/v  U  *1 

Aa 

k 

ETSS 

0.10 

0.45486+02 

0.45390+02 

0.55300+01 

0.1817200+03 

0.50 

0.26200+01 

0.89290+00 

0.73230+01 

0.7026480+01 

2 

1.00 

031216+00 

0.49830+00 

0.70976+01 

0.1620560+01 

2.00 

O9556a-01 

0.10706+00 

0.66960+01 

0.4051530+00 

4.00 

028340-01 

0.2232001 

0.61270+01 

0.1013000+00 

0.10 

0101801-03 

010440+03 

O.3688o+Ol 

0.6159380+03 

aso 

047780+01 

033400+01 

0.46160+01 

0.2433440+02 

3 

1.00 

089710+00 

010500+01 

0.73006+01 

0.5840850+01 

2.00 

018760+00 

027746+00 

O.66O60+OI 

0.139489e+01 

4.00 

08039e-01 

0.35860-01 

0.6112O+01 

0.3487380+00 

aio 

013920+03 

0.15150+03 

0.2751O+01 

0.1122410+04 

050 

0159000+01 

0.55540+01 

0.29470+01 

0.447121O+02 

4 

1.00 

017110+01 

0.10370+01 

0.41 59o+01 

0.1096390+02 

2.00 

032550+00 

0.32700+00 

0.5857O+01 

0.2608990+01 

4.00 

O.8O6I0-OI 

0.8252001 

0.57376+01 

0.6522520+00 

0.10 

016310+03 

02013o+03 

0.2278O+01 

0.1664850+04 

050 

067660+01 

0.76570+01 

0.23410+01 

0.666234O+02 

5 

1.00 

0.18260+01 

0.1630O+01 

0.27460+01 

0.1659610+02 

2.00 

038640+00 

041820+00 

0.40506+01 

0.3994140+01 

4.00 

096990-01 

010616+00 

0.40870+01 

0.9985386+00 

OlO 

023570+03 

043040+03 

0.14940+01 

0.4518240+04 

050 

096870+01 

0.17380+02 

0.14680+01 

0.181456O+03 

10 

1.00 

025040+01 

0.44550+01 

0.13960+01 

0.458878O+02 

2.00 

0.63670+00 

0.1178 0+01 

0.13616+01 

0.1175150+02 

4.00 

015820+00 

0.27580+00 

0.14700+01 

0.2952600+01 

0.10 

0.27140+03 

0.58550+03 

0.13690+01 

0.7448876+04 

050 

011000+02 

0.2372o+02 

0.13526+01 

0.2991970+03 

15 

1.00 

0.28580+01 

0.61190+01 

0.13060+01 

0.7573690+02 

2.00 

0.74660+00 

O.I6680+OI 

0.12556+01 

0.1959060+02 

4.00 

019060+00 

0.4032e+00 

0.13136+01 

0.5010040+01 

Table  18:  Constanta  to  implement  the  two-stage  procedure  Vi  for  selecting  the  largest 
logistic  population:  P*  —  0.90. 


P*  =  0.90 

6/9 

A 7 

k 

ETSS 

aio 

0.16680+03 

0.1738©+O3 

034460+01 

0.6501940+03 

050 

07013©+ 01 

0.6404©+01 

0.35910+01 

0.2597260+02 

1.00 

0.1933©+ 01 

013110+01 

033690+01 

0.6432010+01 

3.00 

04011  o+OO 

03734«+00 

0.53310+01 

0.1546200+01 

4.00 

0.1044©+ 00 

0.8907*01 

0.50260+01 

0.386564©+00 

aio 

O3745©+03 

035130+03 

0.20170+01 

0.146152O+04 

050 

01136©+ 03 

0.9634*+01 

0.20710+01 

0.5856650+02 

1.00 

0.3971 ©+01 

O3135*+Ol 

033320+01 

0.146860o+02 

3.00 

06894©+ 00 

051890+00 

0.50040+01 

0.3621970+01 

4.00 

01693©+00 

01310e+00 

0.49550+01 

0.900049O+00 

010 

0S398a+03 

0.33180+03 

0.17130+01 

0.3299400+04 

050 

01340©+03 

013000+03 

017380+01 

0.9229620+02 

1.00 

0.3489©+ 01 

03048*+01 

0.17960+01 

0.2329170+02 

3.00 

OS374*+00 

070080+00 

0.26430+01 

0.5926620+01 

4.00 

03090©+00 

017040+00 

0.28310+01 

0.1474620+01 

010 

0.3664©+ 03 

040340+03 

0.15560+01 

0.3150130+04 

050 

01488©+ 03 

015950+03 

0.15530+01 

0.1265420+03 

1.00 

0.3883©+ 01 

038580+01 

015590+01 

0.3201850+03 

3.00 

09610©+ 00 

093170+00 

0.1867o+01 

0.8399640+01 

4.00 

0.3403©+ 00 

0.31840+0 0 

0.20710+01 

0.2083300+01 

010 

0.4549©+03 

064650+03 

013670+01 

0.7501000+04 

050 

0.1 844*+ 03 

0.35880+03 

0.13570+01 

0.3016140+03 

1.00 

O47S4*+01 

0.64970+01 

0.13280+01 

0.7656620+02 

3.00 

0.1338©+ 01 

0.16440+01 

0.12570+01 

0.2013950+02 

4.00 

0.3335©+ 00 

0.43630+00 

0.13630+01 

0.5284810+01 

010 

0.4934©+ 03 

0.79110+03 

0.13660+01 

0.119540O+05 

050 

0.1999©+ 03 

0.3177*+O3 

0.13580+01 

0.4808220+03 

1.00 

05180©+01 

0.80330+01 

0.13350+01 

0.1221870+03 

3.00 

01433©+ 01 

0.30740+01 

0.12800+01 

0.3324600+02 

4.00 

03751«+00 

0.55930+00 

0.132 80+OI 

0.8652740+01 

iViJVUj  ««  M.I'1  o a ■> ->“»<«  i 
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Table  19:  Constants  to  implement  the  two-stage  procedure  Vi  for  selecting  the  largest 
logistic  population:  P*  =  0.95. 


P*  m  0.96 


Sfo 

ftl 

ft 2 

k 

ETSS 

0.10 

0.3008*403 

03827*4-03 

0.1781*401 

0.104953*404 

050 

01227*402 

0.1096*4-03 

0.1810*401 

0431347*402 

1.00 

0.32J5e+01 

0.3504*4-01 

01958*401 

0.106223*402 

2.00 

0.7831 *+00 

0.5883*4-00 

0.3556*401 

0.268333*401 

4.00 

01899*400 

0.1457*4-00 

0.3785*401 

0.667647*400 

aio 

0.4362*403 

03657*4-03 

0.1574*401 

0.211419*404 

050 

ai 788*4-02 

ai436*+02 

0.1589*401 

0.849314*403 

1.00 

a 45 79*4-01 

03388*4-01 

01654*401 

0.214801*402 

2.00 

ai223*4-01 

0.6952*4-00 

0.3369*401 

0.553339*401 

4.00 

02858*4-00 

0.1853*400 

0.3237*401 

0139794*401 

aio 

04901*4-03 

0.4519*4-03 

0.1452*401 

0318364*404 

aso 

02023*403 

0.1787*4-02 

0.1453*401 

0.127964*403 

1.00 

05232*4-01 

0.4335*4-01 

01464*401 

0.324315*402 

2.00 

01420*4-01 

0.9417*4-00 

01675*401 

0.846044*401 

4.00 

0.3383®+ 00 

0.3423*4-00 

0.2163*401 

0.218343*401 

aio 

05381 *+03 

05259*4-03 

01393*401 

0.426098*404 

050 

0.21 82  *+02 

0.2066*4-03 

0.1388*401 

0.171314*403 

1.00 

05849*4-01 

0.5112*4-01 

0.1379*401 

0.434710*403 

2.00 

01548*4-01 

01182*401 

0.1430*401 

0.114045*403 

4.00 

0.3809*+ 00 

0.3003*4-00 

0.1751*401 

0.399638*401 

aio 

0.8279*+ 03 

0.7683*+ 03 

0.1349*401 

0.973703*404 

aso 

02544*403 

0.3070*402 

0.1343*401 

0.391770*403 

1.00 

0.65 93 *+01 

0.7667*+ 01 

0.1331*401 

0.996400*403 

2.00 

01827*4-01 

0.1933*4-01 

0.1369*401 

0.263641*403 

4.00 

0.4897*4-00 

0.5216*4-00 

0.1344*401 

0.724963*401 

0.10 

0.6874*4-03 

0.9136*+03 

0.1377*401 

0.153152*405 

aso 

0.2703*+ 02 

0.3659*+ 03 

0.1370*401 

0.616396*403 

1.00 

0.7002*+ 01 

0.9178*+01 

0.1354*401 

0.156917*403 

2.00 

0.1942 *+01 

0.3339*401 

0.1310*401 

0.416523*402 

4.00 

0.5293*+ 00 

0.6784*400 

0.1300*401 

0.115109*402 

m 
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Table  20:  Constants  to  implement  the  two-stage  procedure  V2  for  selecting  the  largest 
logistic  population:  P *  =  0.99. 


j  P*  a  0.99  S 

* 

S/o 

*2 

h 

ETSS 

0.10 

0.6892*4-03 

0.5071*44)3 

01296*44)1 

0.202774*4-04 

aso 

0.2784*4-02 

0.2014*4-02 

0.1300*4-01 

0.815644*4-02 

2 

1.00 

0.7189*4-01 

0.4895*4-01 

01309*44)1 

0.207286*4-02 

3.00 

0.1966*4-01 

01107*4-01 

01414*4-01 

0.546768*4-01 

4.00 

0.5013*4-00 

0.2771*4-00 

0.2143*44)1 

0.148773*4-01 

aio 

0.8588*4-03 

0.5804*44)3 

01248*4-01 

0.366047*4-04 

0.50 

0.3475*4-02 

0.2299*44)2 

01249*44)1 

0.147249*4-03 

3 

1.00 

0.8954*4-01 

0.5614*4-01 

0.1254*4-01 

0.374304*4-02 

2.00 

0.2442*4-01 

0.1300*44)1 

0.1314*4-01 

0.988866*4-01 

4.00 

0. 861804-00 

0.3209*44)0 

0.2062*4-01 

0.278376*4-01 

0.10 

0.9268*4-03 

0.8663*4-03 

0.1264*4-01 

0.526885*4-04 

0.50 

03750*4-02 

02846*44)2 

01253*1-01 

0.211902*4-03 

4 

1.00 

09668*4-01 

0.6501*44)1 

0.1250*4-01 

0.539214*4-02 

2.00 

0.2847*44)1 

0.1542*44)1 

0.1266*44)1 

0142783*4-02 

4.00 

0.7350*4-00 

0.4850*4-00 

0.1230*4-01 

0.404965*4-01 

0.10 

0.963Se+03 

0.7432*4-03 

0.1271*4-01 

0.687580*4-04 

aso 

03903*4-02 

02965*4-02 

0.1266*44)1 

0.276700*4-03 

5 

1.00 

01008*44)2 

0.7283*4-01 

01258*4-01 

0.704195*4-02 

2.00 

0.2770*4-01 

01751*4-01 

0.1252*44)1 

0.186821*4-02 

4.00 

07688*4-00 

0.4714*4-00 

0.1451*4-01 

0.532902*4-01 

aio 

0.1049*4-04 

0.9971*4-03 

0.1343*4-01 

0.149575*4-05 

aso 

04248*44)2 

0.3976*4-02 

0.1340*4-01 

0.602284*4-03 

10 

1.00 

01099*4-02 

0.9879*4-01 

0.1327*4-01 

0.153541*4-03 

2.00 

03045*44)1 

0.2443*4-01 

0.1296*4-01 

0.409579*4-02 

4.00 

08789*4-00 

0.6787*44)0 

0.1394*4-01 

0.119728*4-02 

0.10 

01088*4-04 

0.1147*44)4 

0.1400*4-01 

0.231194*4-05 

0.50 

0.4406*4-02 

0.4583*44)2 

0.1396*4-01 

0.931205*4-03 

15 

1.00 

01140*4-02 

0.1143*4-02 

0.1384*4-01 

0.237593*4-03 

2.00 

0.3168*4-01 

0.2855*4-01 

0.1358*4-01 

0.635515*4-02 

4.00 

0.9277*4-00 

0.8022*4-00 

0.1455*4-01 

0.187967*4-02 

-.1 1 


II 


lVX»S>' 
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EquQjr  Spaced  Confifuralion 


S/o 


0.999 

a973 

a  836 

0.730 

0.SS1 

0.803 

1.000 

0.991 

0.887 

a  732 

0.886 

0.807 

1.000 

1.000 

0.941 

0.776 

0.873 

0.830 

a933 

0.938 

0.978 

0.782 

0.793 

0.834 

0.689 

0.898 

0.713 

0.643 

0.648 

0.663 

0.884 

ass9 

0.573 

asia 

0.833 

0.838 

Table  22:  Relative  efficiency  of  the  two-stage  procedure  ‘Pj:  Slippage  configuration. 


Slippage  Configuration 


0.999 

1.000 

1.000 

1.000 

0.996 

1.000 

0.999 

0.907 

0.930 

0.988 

0.999 

0.811 

0.839 

0.980 

0.518 

0^20 

0.531 

0.540 

0.457 

0.481 

0.474 

0.485 

0.922 

0.935 

0.975 

0.998 

0.796 

0.809 

0.853 

0.994 

0.698 

a7oe 

0.730 

0.854 

asso 

0.642 

0.858 

0.711 

0.537 

0JS32 

0.546 

0.590 

0.494 

0.499 

0.513 

0.551 

asoo 

0.826 

0.847 

0.985 

0.709 

0.715 

0.734 

0.818 

a«Bi 

0.858 

0.671 

0.733 

a«i« 

0.621 

0.638 

0.878 

0.845 

0.550 

0.584 

0.606 

0.517 

0.533 

0.535 

0.574 

0.739 

a  758 

0.695 

0.739 

0.871 

0.707 

0.653 

0.690 

0.603 

0.641 

0.577 

0.615 

W? 
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4  A  SINGLE-STAGE  RESTRICTED  SUBSET  SE¬ 


LECTION  PROCEDURE  FOR  SELECTING  THE 
POPULATION  WITH  THE  LARGEST  MEAN 
FROM  k  LOGISTIC  POPULATIONS 

4.1  Introduction 

In  the  subset  selection  formulation,  if  the  data  make  the  choice  of  the  best  population 
difficult  (we  would  expect  this  to  happen  if  the  fi,  are  all  very  close  to  one  another), 
we  are  likely  to  select  all  the  populations.  In  this  case  it  is  meaningful  to  put  on  an 
additional  restriction  that  the  size  of  the  selected  subset  will  not  exceed  m  (1  <  m  <  k). 

When  we  use  an  elimination  type  two-stage  selection  procedure  to  select  the  best 
population  and  we  have  only  limited  resources  to  use  for  the  secondary  exploration, 
we  also  need  more  flexible  procedures  which  allow  us  to  specify  an  upper  bound  m 
on  the  number  of  populations  included  in  the  selected  subset.  Any  selection  problem 
with  such  a  restriction  on  the  size  of  the  subset  is  naturally  called  a  restricted  subset 
selection  problem. 

Gupta  and  Santner  (1973)  studied  the  restricted  subset  selection  procedure  for  the 
normal  mmng  problem  in  terms  of  the  sample  means.  They  provided  the  tables  of 
the  required  sample  sizes  and  of  the  expected  number  of  selected  populations.  Sant¬ 
ner  (1975)  defined  a  general  restricted  subset  selection  procedure  in  terms  of  a  set 
of  consistent  estimators  for  the  parameters  whose  distributions  form  a  stochastically 
increasing  family  for  any  given  sample  size.  He  proved  that  the  infimum  of  the  prob¬ 
ability  of  a  correct  selection  occurred  at  a  point  in  the  preference  zone  for  which  the 
parameters  were  as  close  together  as  possible.  He  also  studied  some  properties  of  the 
rule  and  conditions  which  guaranteed  that  the  supremum  of  the  expected  number  of 
populations  selected  over  the  whole  parameter  space  occurred  at  some  point  where  the 
k  populations  were  all  the  same. 

In  this  chapter  we  consider  a  restricted  subset  selection  procedure  /?3,  based  on 


the  sample  means,  for  selecting  the  population  with  the  largest  mean  from  k  logistic 
populations  when  the  common  variance  is  known. 

Expressions  for  the  probability  of  a  correct  selection  for  any  configuration  of  the 
logistic  means  and  for  the  infimum  of  the  probability  of  a  correct  selection  over  the 
preference  zone  are  derived  and  some  properties  of  this  procedure  such  as  monotonicity 
and  consistency  are  studied. 

The  restricted  subset  selection  procedures  are  consistent  with  respect  to  the  pref¬ 
erence  zone.  However  the  infimum  of  the  probability  of  a  correct  selection  over  the 
preference  zone  can  not  become  arbitrarily  close  to  the  probability  level  P*  as  the 
constant  h,  which  defines  the  procedure,  becomes  infinitely  large  for  the  given  values 
of  k,  m,  6  and  n.  This  is  unlike  the  ‘usual’  subset  selection  procedures.  A  table  of  the 
bounds  of  the  infimum  of  the  probability  of  a  correct  selection  over  the  preference  zone 
is  provided  for  given  values  of  k,  m,  6  and  n. 

A  table  of  the  required  sample  sizes  for  the  restricted  subset  selection  procedure,  the 
sample  sizes  for  the  corresponding  fixed  subset  size  procedure  of  Desu  and  Sobel  (1968) 
and  the  ratio  of  the  above  two  sample  sizes  is  given  for  selected  values  of  P*,  k,m  and 
6.  The  expected  number  of  the  selected  populations  for  the  two  special  configurations, 
namely  the  equally  spaced  and  the  slippage  configurations,  are  considered. 

Instead  of  designing  the  rule  by  choosing  the  required  sample  sizes  for  arbitrarily 
given  values  of  h,  we  can  make  choice  of  the  rule  by  controlling  the  supremum  of  the 
expected  size  of  the  populations  selected  over  the  whole  parameter  space  as  well  as 
the  probability  level  P*  simultaneously.  Using  this  new  design  criterion  a  table  of  the 
design  constants  (n,  h)  for  the  restricted  subset  selection  rule  R3  is  provided. 

4.2  Formulation  of  the  problem 

Let  *■*,  i  =  1, . . . ,  Jk,  be  Jfc  logistic  populations  with  unknown  means  Hi  and  a  common 
known  variance  <72,  which  are  denoted  by  £(/jj,cr3).  Also  let 


/*(i]  <  •  •  •  <  P[k] 


be  the  ordered  means  and  X(q  the  population  with  mean  /*[,-],  the  best  population  being 
x(k)'  We  asstime  that  there  is  no  a  priori  knowledge  concerning  the  padring  of 
and  {*<}.  Let  8  >  0  and 

ft  =  {/T=  (/ij, •••,/!*); -oo  <m  <  oo,i  =  1,...,*} 

=  {$  €  ft  |  (/![*]  -  0[k-l])  >  £} 
ft°(5)  =  {fi  6  ft(£)|  H[i]  =  =  H[k]  ~  £}. 

Each  Xi  yields  it'd  observations  X{j,  j  =  1, . . . ,  n,  i  =  1, . . . ,  k,  which  are  also  inde¬ 
pendent  between  populations.  We  propose  the  following  rule  R3  based  on  the  means 
of  samples  of  size  n  from  the  fc  populations.  As  usual,  let  be  the  sample  mean  from 
x^  i  =  1, . . . ,  k,  and  let 

^[i]  <  *  *  •  <  ^[*] 

denote  the  ordered  sample  means. 

Rule  R3  :  Select  x,  iff 

"Xi  >  max{X[fc_m+1],3Tw  -  hzc/y/n},  h3  >  0.  (61) 

Goal  of  the  experimenter  :  Given  P”,  6  and  the  rule  R3  which  selects  a  subset  of 
the  populations  not  exceeding  m  in  size,  find  the  common  sample  size  n  necessary  to 
achieve 

Pff[CS\R3]  >P'Vp€  ft(6).  (62) 

The  evert  [C5|f?3]  occurs  if  and  only  if  the  selected  subset  contains  T(*j. 

Remark  4.1  Even  though  the  emphasis  in  this  chapter  is  on  the  case,  1  <  m  <  k, 
where  the  strict  inequality  8  >  0  insures  that  the  indifference  zone  does  not  vanish,  it 
should  be  noted  that  the  general  theory  formally  reduces  to  give  the  results  of  Section  2.8 
and  Section  2.4  for  the  choices  of  m  =  1  and  m  =  k  respectively  by  allowing  the  weaker 
condition  8  >  0. 


Remark  4.2  Ifh3  — ►  00,  R3  is  the  fixed  size  subset  rule  which  is  considered  in  Desu 
and  Sobel  (1968). 


4.3  Probability  of  a  correct  selection 


We  introduce  the  following  notation.  For  every  l  —  1, . . . ,  fe  and  for  every  t  =  fe¬ 
rn, fc  —  1,  let 

{3(0.; =i . (‘71)} 

denote  the  collection  of  all  subsets  of  size  i  from 


v 

* 

5 


& 


«(0  =  {1 . *}-« 

and 

3(0  =  «(0  -  3(0- 

Theorem  4.1  For  any  fi  €  ft,  we  have 
PitiCSlRs] 

*-i  (*“*) 

=  s  e  /  n  ^ +(/*[*]-  ) 

*=A-m  i=l  °°  l^S'j(k) 

'  n  + ^3 + (/<[*]  -  <*■ ) 

-^(t  +  (mm  -  Mw)V^/*  )}dFn(f),  (63) 

where  Fn(t)  is  the  cdf  of  the  standardized  mean  of  a  sample  of  size  n  from  L(m,<77). 

Proof 

Let  *(0  denote  the  sample  mean  from  the  population  *(,■).  Then, 

=  P*[X(fc)  >  max{X[*_m+i], Xm  -  hza/y/rx)] 

=  -Pr{^(*)  >  X<i)  —  h^o/y/n  for  l  <k  and 
X(k)  >  at  least  (fc  —  m)  X(/)S  with  1  ^  fe}. 

Now,  for  every  t  =  fe  —  m, . . . ,  fe  —  1  and  j  =  1, . . . ,  (fc”1)» 

A)  =  [Xw  >  X(j)  V  /  €  5j(fc)  and  X(*)  <  X(j)  V  /  €  3^(fe)]. 


JLS. 


Then 

P,ICS\R,] 

—  Pf  P(*)  —  "Xii)  —  h&l'/n  Vlct  and  Lfck-m  U>i  ^  A}] 

k  1  (fc_1) 

=  £  53  —  ^(0  “  h&lyfii  V  l  <  k  and  A)]. 

mi-m  j»l 

For  fixed  *  and  j, 

Pjj[7(*)  >  —  h3<r/ y/n  V/<Jb  and  Aj] 

=  P^pT(fc)  >  X(j)  V  /  €  Sj(&)  and  7(*)  <  7(j)  <  7(*)  +  A3<r/ y/n  W  € 

=  f  II 

°°  /€Sj(k) 

•  II  W* + ^3 + (mw  -  m) 'ftla ) 

»e3 >) 

-Pn(*  +  -  HV\)y/n/(T  )}dFn(t).  □ 

Remark  4.3  An  application  of  the  dominated  convergence  theorem  shows  that 

P*[CS|Rs]  -*  1  <M  (/*[*]  -  fi[k-i])  -*  oo.  (64) 

Next  we  determine  the  infimnm  over  Cl{6)  of  the  probability  of  a  correct  selection 
in  the  following  theorem. 

Theorem  4.2  For  any  p  €  fi(6),  toe  have 

inf  Pj[C5|i?3]  =  inf*r€no(5)  Pf[CS |iZ3] 

i*€0  (s) 

=  e  (*_1)  r  w+w»))‘ 

imk—m  \  *  /  —0° 

*{Pn(t  +  h3  +  6y/n/ <T  )  —  F n(t  +  Sy/n/<7  )}  dF n(t) 

=  J  {Fn(t+ h3  + 6y/n/a  )}*_1 


(65) 


toAerv  /{y;a,b)  -  -  w)b~ldw  denote  the  incomplete  beta  function  with 

parameters  a  and  b. 
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We  use  the  following  lemma  due  to  Alam  and  Rizvi  (1966)  and  also  due  to  Ma- 
hamunulu  (1966). 

Lemma  4.1  Let  X  =  (Xi, . . . ,  X*)  have  k  >  1  independent  components  such  that  for 
every  i,  X{  has  cdf  H{x{\ Of).  Suppose  that  {H(x\6)}  form  a  stochastically  increasing 
family.  If 'll (X.)  is  a  monotone  function  of  Xi  when  all  other  components  ofX  are  held 
fixed,  then  £j{^(X)]  is  monotone  in  9i  in  the  same  direction. 


Now,  let 


*(X)  =  <  1;  d  *(k)  - 
0;  otherwise. 


-m+ll,^[fcj  ~  h3<T/y/n) 


We  claim  \t(X)  is  non-increasing  in  for  t  =  1, . . . ,  k  —  1.  Let 

X(<)  <  T(i), 

X  =  (X(l),...,X/fc)) 


x'  =  (^(i),  •  •  • ,  3*  (o»  X(i+1), . . . ,  X(i)). 


Then 


max{X[k_m+xj,  -  hza/y/n)  <  rnax{T[*_m+1],T(jt]  -  h3a/y/n) 
where  the  primes  denote  the  order  statistics  from  X'.  So  if  ®(X)  =  0  then  'J'(X')  =  0. 


Hence 


Pit[csm  =  Enmx)) 


is  non- increasing  in  each  of  /i[i]».  •  i]  when  all  other  means  are  fixed.  So 

inf  iyCS|i?3]=  inf  Pp[CS\Rz\ 

iT€n(f)  1  pen °(5) 

and  hence  substituting  the  vector  of  means  (^j, . . . ,  ^[i],  p[i]  +  5)  gives  the  result.  □ 


4.4  Properties  of  R$ 

We  consider  next  the  properties  of  the  restricted  subset  selection  rule  R3  based  on  the 
sample  means.  To  facilitate  this  study  we  let 


Pit(i\R)  =  Pf{rult  R  selects  X(,-)}  (66) 

and  recall  the  following  definitions. 

Definition  4.1  R  is  a  monotone  procedure  means  that  for  all  p  £  Cl  and  i  <  j, 

Pf(i\R)  <  W- 


Definition  4.2  R  is  an  unbiased  procedure  means  that  for  all  p  £  Cl  and  j  <  k, 

Pj*[R  does  not  select  xy)]  >  Pp[R  does  not  select  X(fcj]. 

Of  course,  R  is  monotone  implies  that  R  is  unbiased.  Other  optimal  properties  axe 
Definition  4.3  R  is  consistent  with  respect  to  O'  means  that 


lim  inf  P*[(7S|P]  =  1. 

fcCl' 


Definition  4.4  R  is  strongly  monotone  in  X(,-)  means  that 


P*W) 


!t  in  p[ j]  when  all  other  components  of  p  are  fixed 

i  in  ftyi  when  all  other  components  of  p  are  fixed  (j  i). 


Theorem  4.3  For  every  i  =  1,. . .  ,k,  R3  is  strongly  monotone  in  X(j). 


Proof 

We  have  already  shown  this  result  for  i  =  k.  Since  for  i  <  k  we  have 


p,m,)  =  es  MX)), 


where 


«7(X)  = 


1;  if  X,o  >  max{X[*_m+i),  X[*j  -  h3<r/ v'n} 
0;  if  otherwise, 
the  same  argument  applies  to  give  the  desired  conclusion.  □ 


1 

I 

I 


I 
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Corollary  4.1  All  rules  of  the  form  (61)  are  monotone  and  unbiased. 

Proof 

The  proof  follows  from  the  definition  of  monotonicity  and  the  property  of  being 
strongly  monotone  in  *■(,•)  for  all  *.  □ 

Theorem  4.4  R3  is  consistent  with  respect  to  f2(<5). 

Proof 

We  must  show  that 

rE  (*  7 l)  + 

■{F.(t  +  h,  +  SyJ n/<r)  -  F,(t  +  Sy/K/<T)}k-'-‘dF^t)  =  1.  (67) 

We  note  that  each  integrand  is  bounded  with  respect  to  the  measure  Fn  and  so  the 
dominated  convergence  theorem  applies.  For  every  i  <  (k  —  1)  we  have 

lim  {Fn(t  +  6y/n/v)¥{Fn{t  +  h3  +  byfrfa)  -  Fn(t  +  =  0 

A  lOO 

and  for  *  =  k  —  1 

+  - 1. 

™  1  OO 

Hence  the  result  follows.  □ 

This  theorem  says  that  no  matter  what  probability  level  is  required  for  a  correct 
selection  it  can  be  met  by  choosing  a  sufficiently  large  sample,  for  any  given  k,  m  and 
6. 

Theorem  4.5  For  every  n  and  rule  R3, 

lim  inf  Pa[C75|fk]  =  1. 

For  every  n,  m  <  k  and  6  >  0, 

lim  inf  P»[C5|i23] 

-  (*  -  m)  (  *  jT  { 1  -  F.(t  -  iyfile)) 

•{F.(()}‘ - ,{l-F„(l)}”’-VFn(() 

=  /  I{Fn(t  +  6Vn/<r)’,k-  m,m)dFn(t).  (68) 

J— OO 
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Both  results  follow  from  the  dominated  convergence  theorem.  The  second  result 
follows  the  same  theorem  and 

J5a.Jtt»Jw°,awd 

=  X  (k~1)r  {K(t  +  6^/c)Y 

sstik— m  >  / 

•{1  -  Fn(t  +  Syfrlatf-'-'dF n(t) 

-  £>-  *)  C:i)  Cpjl* + «*/.),  s”"{1  -  »>*-■**«•« 

-  (‘-")(*‘:i)£jCU/,  P-F.W)-' 

letting  to  as  F“1(l  —  y)  and  changing  the  order  of  integration.  □ 

Remark  4.4  The  first  part  states  that  by  taking  6  sufficiently  large  we  can  attain  any 
P*  probability  requirement  for  the  rule  iZ3  based  on  any  number  of  observations.  The 
second  result  says  that  given  a  fixed  6  >  0  and  a  common  sample  size  n,  we  cannot 
achieve  all  P*  values.  We  can  only  attain 

p-  <  (*-™)(*r^)/_ji-F.(<-«v^)} 

•{F.(i)}‘ - ‘{1  -  F.(()}"-l<<F.(l) 

«  r°  I{Fn(t  +  Sy/n/a);  k-m,  m}dFn(t)  <1.  (69) 

J_oo 

Remark  4.5  Using  the  monotonicity  of'wfit€a(6)Pa[CS\R3 ]  toe  can  obtain  the  follow¬ 
ing  bounds.  For  m  <  k  and  6  >  0, 

r{Fm(t  +  6y/n/<r)}h-ldFm(t) 

J—  oo 

<  inf  P*[C5|f?3] 

a€0(«)  1 

<  f  I{Fn(t  +  Sy/n/a);  k  —  m,  m}dF„(t). 

J—  OO 


(70) 


Table  23  contains  the  above  lower  and  upper  bounds  of  the  infimum  of  the  proba¬ 
bility  of  a  correct  selection  over  Q(6)  for  k  =  3, 5, 10,  m  =  2, 4, 5  <  ife,  8/a  =  0.5, 1.0, 2.0 
and  rt  =  5, 15.  All  computations  were  carried  out  in  double-precision  arithmetic  on  a 
Vax-1 1/780. 

For  the  purpose  of  implementing  the  procedure  R3  and  comparing  R3  to  the  fixed 
size  subset  rule,  we  have  prepared  Table  24  and  Table  25.  For  Pm  =  0.90,  k  =  5, 10, 
m  =  2, 3, 4, 5  <  k ,  6/<r  =  0.5, 1.0, 2.0  and  h3  =  0.4, 0.7, 1.3, 1.6,  the  tables  give  the 
values  of  the  minimum  sample  size  (n(/ts))  which  satisfies 

£>(‘  +  *3  +  *  -  m.mMF.W  >  r. 

the  values  of  the  minimum  sample  size  (n(oo))  for  the  fixed  size  subset  rule,  which 


satisfies 


r°  I{Fn(t  +  Sy/n/<r);  k  -  m,  m}dFn(t)  >  Pm 

J— OO 


and  the  ratio  (n(/i3)/n(oo))  of  the  sample  size  for  the  restricted  subset  selection  rule 
R3  to  the  sample  size  for  the  fixed  size  subset  rule  when  both  rules  attain  the  same 
probability  requirements.  For  large  h3  values  this  ratio  is  close  to  one,  indicating  that 
in  many  cases  a  slight  additional  cost  will  allow  the  use  of  a  restricted  subset  selection 
procedure  which  meets  the  same  probability  requirement. 

The  expected  number  of  selected  populations  depends,  of  course,  on  the  underlying 
/?.  Some  exact  comparisons  for  the  equally  spaced  and  slippage  configurations  will  be 
considered  in  the  next  section. 

4.5  Expected  number  of  selected  populations 


As  usual,  we  define 


^  f  i;  if  *(0  ^  E 

0;  otherwise. 


>  rnax{X[fc_m+1j,X[fcj  -  h3(r/y/n} 


This  gives  5,  the  number  of  populations  selected,  as 

s  =  X> 


.vVv*/.v: 


^  Wo  co  o 


Then  the  expected  number  of  populations  selected  by  R3  is  given  by 

*«Wy»EWW*). 

where  Pj(*|f23)  is  defined  by  (66). 

Theorem  4.6  For  any  p  €  fl,  we  have 

k  k-l  (***) 

*  E  E  £  J  n  ^»(*  +  (/*[.]  -  MM)\/n/<r) 

iml  J*xk-m  jml  l€S'(0 

•  II  {?*(? + ^3  +  (mw  -  m) >/»/*) 

/e3J(0 

-P„(<  +  (/*[•!  -  /in)v/n/<r)}dPB(t),  (71) 

where  F„(t)  ia  the  cdf  of  the  standardized  mean  of  samples  of  size  n  from  L(/i<,< ra). 
Proof 

Prom  the  above  discussion,  we  see  that  it  suffices  to  calculate  Pp(i\R3)  for  t  =  l,...,  I 
Using  arguments  similar  to  those  in  the  proof  of  Theorem  4.1,  we  get 

k  1  (*“*) 

fjtW  -  E  t,  P' Wo  £  X,I)  V  /  6  Sf(i) 

pmh—m  /al 

and  X(,-)  <  X(j)  <  X(i)  +  h3<r/ y/n  V  /  6  ^(t)} 

*-1  (V)  ,00 

=  £  £  /  II  F"(*  +  0*m  -  m)^!a) 

pcmk-m  jml  J~°°  l€S?({) 

•  n  {Pn(t  +  h3  +  -  turfy/n/a) 

^(0 

-P„(t  +  (/xw  -  Mfl)v/"M}<tf’n(f)-  D 

Since  2?/r[5|i23]  is  increasing  in  h3  the  experimenter  may  seek  to  use  rules  with 
small  h3.  On  the  other  hand,  for  fixed  6  and  P*,  the  smaller  h3  is,  the  larger  n  must 
be  to  achieve  the  required  probability  condition  (62).  Hence,  the  experimenter  must 
decide  what  trade  off  between  n,  h3  and  6  he  is  willing  to  accept.  To  investigate  the 
interdependence  in  more  detail,  we  have  tabulated  in  Table  26  and  Table  27  the  values 


.*  v.y .vj «v ■'././.■v.y.v.  *\ 

/».%  *•  ,n j .'•/.S' ■  ^ . 


•  E(S)  =  EAS\R& 


•  E(SR)  =  Ylimi  *-P^(t|i?3);  the  expected  sum  of  ranks  of  the  selected  populations 

and 

•  E(S)/m;  the  expected  proportion  of  selected  populations 
under 

1.  Equally  spaced  means  p  =  (a,  a  +  6, . . . ,  a  +  (ife  —  1)6)  and 

2.  Slippage  means  p  =  (a,  a, . . . ,  a,  a  +  6), 
for 

(*,  m)  =  (4, 2),  (5, 3),  n  =  2, 3, 4, 5, 10, 15,  h3  =  0.4, 0.7  and  6/a  =  0.1, 0.5, 1.0, 2.0.  All 
computations  were  carried  out  in  double-precision  arithmetic  on  a  Vax-1 1/780. 

4.6  Supremum  of  the  expected  number  of  selected  popula¬ 
tions  and  a  new  design  criterion  for  R$ 

Santner  (1975)  considered  a  general  restricted  subset  selection  procedure  in  terms  of 
consistent  estimators  for  the  population  parameters  whose  distributions  form  a  stochas¬ 
tically  increasing  family  for  each  given  sample  size.  In  particular  he  gave  conditions 
which  guarantee  that  the  supremum  of  the  expected  number  of  populations  selected 
over  the  whole  parameter  space  occurs  at  some  point  where  the  k  population  parame¬ 
ters  are  all  the  same. 

We  can  consider  the  means  rule  R3  as  a  special  case  of  a  location  parameter  problem 
using  Santner’s  general  procedure.  By  noting  that  the  distribution  of  the  mean  of 
samples  from  a  logistic  population  has  the  MLR  property  with  respect  to  the  location 
parameter  and  hence  forms  a  SI  family,  we  can  see  that 

Theorem  4.7  For  every  p  6  fl,  we  have 

sup£*[S|R3] 


In  Section  4.4  we  determined  the  needed  sample  size  n  for  the  rule  f?3  for  the  arbitrarily 
chosen  values  of  h3,  k,  m  and  6  because  we  could  not  determine  the  values  of  n  and 
hz  at  the  same  time  by  controlling  the  basic  probability  requirement  only.  Since  we 
desire  to  select  smaller  S ,  it  is  reasonable  to  make  a  choice  of  (n,  h3)  by  controlling  the 
sup^jj  Eft  [5 1 J23]  as  well  as  inf P#  [C5|  P3] . 

Using  Theorem  4.2  and  Theorem  4.7,  we  can  choose  a  new  set  of  design  constants 
(n,  h3)  to  implement  R3  by  solving  the  following  equations  simultaneously, 

r  {Fnit+hz  +  Sy/Z/*)}11-1 

J— oo 

-  p'  <73> 

+  “  1  +  e  (74) 

for  the  given  values  of  P*, k,  m,  6  and  small  c  >  0. 

A 

Table  28  and  Table  29  contain  the  estimates  (n,  h3)  for  the  constants  (n,  h3), 
which  satisfy  (73)  and  (74)  simultaneously  for  given  values  of  P*  =  0.90, 0.975,  k  = 
3,4,5,10,15,  m  =  2, 3, 4, 5,  6/a  =  0.5, 1.0, 2.0  and  e  =  0.01.  All  computations  were  car¬ 
ried  out  in  single-precision  arithmetic  on  a  CDC-6500.  The  IMSL  subroutine  ZSCNT 
was  used  to  solve  the  above  system  of  non-linear  equations  and  the  f-norm  in  the  tables 
indicates  the  accuracy  of  the  computation,  which  is  defined  in  the  ZSCNT. 


Table  24:  The  minimum  sample  sizes  needed  for  rule  R3  and  the  corresponding  fixed 
size  subset  rule:  P *  =  0.90,  k  =  5. 


P*  s  0.80,  k  : 


0.70 

0.50 

1.00 

2.00 

0.50 

1.30 

1.00 

2.00 

0.50 

1.60 

1.00 

2.00 

n(As)/n(oo) 

1.683 

1.500 
2.000 
1.463 
1.250 
2.000 
1.385 
1.350 
2.000 
1.385 
1.250 
2.000 
3.333 

2.500 
3.000 

2.667 
2.000 
1.000 
1.833 

1.500 
1.000 

1.667 

1.500 
1.000 
10.000 


.«  ij>  «i 
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Table  28:  Estimates  of  constants  for  rule  R3:  P *  =  0.90,  Sup(S)  =  1.01. 


P*  m  0.900, 

~JL'  T 


,  StipjS)  a 
ft 

,1965*+02 

.4883*441 

■11880+01 

.23830+02 

.59460+01 


hj  /  —  norm 
.11?9»41  .6693o-15 

.1166*41  .46480-22 

.1112041  .89250-15 

.9717o42  .20490-17 

.9648*42  .2318o-18 


■14760+01  .93840-02  .12030-13 

.2383»+02  .9667*42  .8419*20 

■5946«+01  ,9598o42  .3176*19 

.14760+01  .93340-02  .3359*15 


.26860+02 

.67150+01 

.1680e+01 


.86130-02  .9771*18 

.8580o42  .2066*13 

.8439e-02  .1658*17 


.26860+02  .8563*02  .4192*16 
.67150+01  .8528*02  .3007*17 


■1680O+01 

.26860+02 


■83860-02  .5923*18 

.8562*02  .4433*16 


.67150+01  .8528*02  .2910*17 
.16800+01  .8386*02  .5918*18 


.35500+02 

.89220+01 

.22730+01 

.35500+02 

.89220+01 


.6535*02  .1496*15 

.6569*02  .4289*13 

■6673*02  .2864*14 

.6488*02  .2623*15 

.6521*02  .2750*16 


■22730+01  .6623o42  .7496*14 
.35500+02  .6488042  .2639*15 


.6521e-02  .2893*16 

.66220-02  .7455*14 

.6488042  .2659*15 

.65210-02  .2891*16 

■6622e-02  .7448*14 

.58030-02  .2848*15 

.58610-02  .1097*16 

■6062*02  .4032*15 

.57580-02  .3480*14 


.89220+01 

■22730+01 

.35600+02 

.89220+01 

.22730+01 

.40240+02 

.10140+02 

.26020+01 

.40240+02 


.10140+02  .58160-02  .2559*15 


■2602 e+ 01 
.40240+02 


.6013*02  .8833*16 

.5758*42  .1364*13 


.10140+02  .5816*42  .2682*15 

.26020+01  .6012*02  .8877*16 

.40240+02  .5758*02  .1381*13 

.10140+02  .5816*02  .2684*15 

.26020+01  .6012*02  .8877*16 


Table  29:  Estimates  of  constants  for  rule  R3:  P*  =  0.975,  Sup(S)  =  1.01 


p*  =  0.975, 

s/°  11 


SMS)  ■ 

a 


/  —  norm 


.389*0+03  .1183«-01  .13430-17 

.97940+01  .11750-01  .60860-19 

.35040+01  .1150*01  .30010-31 


.97380-03  .31950-31 

.9691003  .14350-31 


.4399o+03 

.11080+03 


■38410+01  .9650003  .3667o-16 

.43990+03  .9677003  ,1730o-17 

.11080+03  .9640003  .17340-17 

.38410+01  .9499003  .35 98*18 

.47590+03  .8618003  .70530-16 

.11990+03  .8600003  .1076*15 

.30830+01  ,8535003  .3339o-16 

.47590+03  .8567*03  .3190*18 

.11990+03  .8549003  .1444*17 

.30830+01  .8473*03  .3198*17 

.4759o+03  .8567*03  .3819*33 


.8549*03  .1444*17 

.8473*03  .3198*17 


.11990+03 

■30830+01 

.57710+03 

.14560+03 

.37660+01 


.8548003  .7657*33 

■8473*03  .3403*33 

.6530*03  .9663*16 

.6553*03  .1135*15 


.6633*03 


.57730+03  .6483*03 


■1408*15 

.3547*17 


.14570+03  .6504*03  .1364*17 

■37660+01  .6574*03  .1773*16 

.57730+03  .6483*03  .3457*33 

.14570+03  .6504*03  .1353*33 

■37660+01  .6574*03  .3067*33 

.5773*+03  .6483*03  .6563*37 

.14570+03  .6504*03  .3534*37 

■37660+01  .6574*03  .4089*36 

.63300+03  .5795*03  .3436*14 

.15960+03  .5833*03  .3331*14 


■41390+01  .5966*03  .3695*14 

.63310+03  .5751*03  .8806*17 

.1596*+03  .5788*03  .1371*17 


■41390+01 

.63310+03 


.5919*03  .8336*17 

.5751*03  .1336*31 


.1596O+03  .5788*03  .7943*33 


■41390+01 

.63310+03 


■5918*03  .1707*31 

.5751*03  .3534*37 


.15960+03  .5788*03  .9314*37 
.41390+01  .5918*03  .5364*36 


5  AN  ELIMINATION  TYPE  TWO-STAGE  PRO¬ 
CEDURE  USING  RESTRICTED  SUBSET  SE¬ 
LECTION  RULE  IN  ITS  FIRST  STAGE  FOR 


SELECTING  THE  BEST  POPULATION 

5.1  Introduction 

Tamhane  and  Bechhofier  (1977,  1979)  studied  a  two-stage  elimination  type  procedure 
for  selecting  the  largest  normal  mean  and  we  considered  in  Chapter  3  an  elimination 
type  two-stage  procedure  Vi  for  selecting  the  largest  among  several  logistic  populations. 
It  is  well  known  that  the  above  two-stage  procedures  Me  quite  efficient  relative  to  the 
corresponding  single-stage  procedures  in  terms  of  the  required  sample  sizes.  However, 
sometimes  we  may  have  only  limited  resources  to  use  in  the  second  stage.  In  those 
cases  we  need  more  flexible  procedures  which  allow  us  to  specify  an  upper  bound  m  on 
the  number  of  populations  included  in  the  selected  subset  in  the  first  stage.  Gupta  and 
Santner  (1973)  studied  the  selection  problem  with  such  a  restriction,  which  is  called 
a  restricted  subset  selection  procedure,  for  selecting  the  largest  normal  mean  and  we 
considered  restricted  subset  selection  procedures  for  selecting  the  largest  logistic  mean 
in  Chapter  4  in  the  framework  of  single-stage  procedures. 

Here  we  propose  an  elimination  type  two-stage  procedure  V'2  for  selecting  a  popula¬ 
tion  with  the  ‘largest’  real  parameter,  in  which  a  generalized  restricted  subset  selection 
procedure  (Santner  (1973,  1975))  is  used  in  the  first  stage  in  terms  of  a  set  of  consis¬ 
tent  estimators  for  the  population  parameters  whose  distributions  form  a  stochastically 
increasing  family  for  a  given  sample  size.  We  also  propose  an  optimization  problem 
using  a  minimax  criterion  to  find  a  set  of  constants  needed  to  implement  'P,. 

We  derive  a  lower  bound  of  the  probability  of  a  correct  selection  and  a  formula  for 
the  infimum  of  the  lower  bound  over  the  preference  zone. 

We  derive  an  analytical  expression  for  the  expected  total  sample  size  and  study 
conditions  guaranteeing  that  the  supremum  over  the  whole  parameter  space  of  the 


expected  total  sample  size  occurs  at  some  point  where  all  of  the  parameters  are  equal. 
We  also  derive  a  general  expression  for  the  supremum  over  the  whole  parameter  space 
of  the  expected  total  sample  size  under  these  conditions. 

A  non-linear  optimization  problem  which  one  must  solve  in  order  to  determine  the 
constants  needed  to  implement  V2  for  location  and  scale  parameter  problems  and  a 
relative  efficiency  of  with  respect  to  the  corresponding  single-stage  procedure  are 
defined. 

We  apply  V2  to  the  location  parameter  problem  of  univariate  normal  populations. 
Here  we  provide  tables  of  constants  to  implement  V2  and  of  the  relative  efficiency  for 
each  case  of  the  equally  spaced  and  slippage  configurations. 

5.2  Preliminaries 

Let  X,-,  *  =  1, . . . ,  jfc,  be  k  populations  which  are  characterized  by  unknown  scalars 
Xi  €  A,  where  A  is  a  known  interval  on  the  real  line.  Let  <  •  •  •  <  A^j  be  the 
ordered  A,’s, 

fi  =  {A  =  (Ai, •  •  • , A*)|A,-  €  A  Vt  } 

the  space  of  all  possible  underlying  configurations  of  A,’s  and  X(,-)  the  (unknown)  pop¬ 
ulation  with  parameter  Aj,-j.  It  is  assumed  that  there  is  no  a  priori  knowledge  of  the 
correct  pairing  of  the  elements  in  {x,}  and  {x(,-)}.  The  goal  is  to  define  a  two-stage 
procedure  V2  to  select  the  ‘best’  population  where  for  sake  of  definiteness  X(fc)  is  taken 
to  be  the  best  population.  In  some  cases  X(i)  might  be  the  best  population.  Of  course, 
if  t  (2  <  t  <  k)  populations  all  have  A<  =  A[*j,  the  selection  of  any  of  these  tied 
populations  accomplishes  the  goal. 

Each  x,  yields  iid  observations  Xij ,  j  =  1, . . .  ,n,  which  are  also  independent  be¬ 
tween  populations.  has  cdf  Fi  with  parameter  A,-.  Furthermore  it  is  assumed  that 
there  exists  a  sequence  of  Borel  measurable  functions  {Tn}  so  that  Tn  is  defined  on  n 
dimensional  sample  space  and 


The  assumptions  concerning T,n  are  that  its  cdf  Gn(y|A<)  with  support  E„'  is  absolutely 
continuous  with  respect  to  Lebesgue  measure  with  pdf  y«(yjAi)  and  {Gn(y|A)|A  €  A} 
forms  a  stochastically  increasing  family  for  each  n. 

A  preference  zone  will  be  defined  in  Cl  by  means  of  a  function 

p  :  A  — ►  3 ft, 

where  3ft  ia  a  real  line,  such  that 

1.  p{-)  is  continuous  and  non-decreasing  on  A 

2.  p(A)  <  A  V  A  €  A 

3.  p  :  A'  ^  A,  where  A'  =  {A  €  A|p(A)  6  A}. 

Define 

fi(p)  as  {A  €  fl|  A{k_i]  <  p(A[fc])} 

and 

to°(p)  =  {A  €  fl|  A{1j  =  A[*_ij  =  p(Aw)}  . 

Let  An(-)  be  a  sequence  of  functions  such  that  for  each  n 

MO  :  En  — *  3? 

where  Ua€a£«  C  En,  satisfying 

1.  For  each  n  and  x,  hn(x)  >  x, 

2.  For  each  n,  hn(x)  is  continuous  and  strictly  increasing  in  x. 

Typical  examples  of  MO  are  given  by 

kn(x)  =  x  +  dn  (dn  >  0) 
for  the  location-type  procedures  and 

A„(3/)  =  CjiX  (cn  >  1) 
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for  the  scale-type  procedures. 

The  goal  of  the  experimenter  is  to  select  a  best  population.  The  experimenter 
restricts  consideration  to  procedures  ("P)  which  guarantee  the  probability  requirement 

PX[CS\V]  >  P’  VAefi(p)  (75) 

where  p(-)  and  Pm  are  specified  prior  to  experimentation.  The  event  [CS]  occurs  if  and 
only  if  the  experimenter  selects  a  best  population. 

Here  we  propose  an  elimination  type  two-stage  procedure  V2  for  selecting  a  best 
population  using  a  restricted  subset  selection  rule  in  its  first  stage  and  an  indifference 
zone  approach  in  its  second  stage. 

Procedure  V2, 

Stage  1:  Take  n i  independent  observations 

from  each  x,-,  t  =  1, . . . ,  Jfe,  and  compute  the  k  estimates 


Determine  the  subset  /  of  (1, . . . ,  k}  where 

l  -  {•!<’  >  [!*.,»  }’ 

and 

denotes  the  ordered  Denote  by  xj  the  associated  subset  of  {wi, ...  Wfc}. 

1.  If  xj  consists  of  one  population,  stop  sampling  and  assert  that  the  population 
associated  with  T^]ni  is  best. 

2.  If  x/  consists  of  more  than  one  population,  proceed  to  the  second  stage. 

Stage  2:  Take  n2  additional  independent  observations  Xjf\  j  =  l,...,n2,  from 
each  population  in  x/,  and  compute  the  cumulative  estimates 

TniXl? . XjllX™ . *£>)  =  Jin  fori  ei, 
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where  n  =  n*  +  n2.  Assert  that  the  population  associated  with  maxt€/  Txn  is  best  using 
randomization  to  break  ties  if  necessary. 

There  is  an  infinite  number  of  combinations  of  (ni,na,  hni)  for  given  k,  m,  Pm  and 
p(-)  guaranteeing  the  required  probability  condition  (75),  and  different  design  criteria 
lead  to  different  choices. 

Let  S'  denote  the  cardinality  of  the  set  I  in  the  first  stage  of  procedure  V2  and  let 

"  „  f  0;  ifS'  =  l  ,  , 

S  =  (76) 

S';  if  S'  >  1. 

Then  the  total  sample  size  required  by  n,  TSS  say,  is 

TSS  =  krii  +  Sn2. 

Let  E%[TSS\V2]  denote  the  expected  total  sample  size  for  V2  under  A.  To  make  a 
choice  of  (ni,n2,  hni)  as  well  as  to  have  the  total  sample  size  TSS  small,  we  adopt  the 
following  minimax  design  criterion. 

For  given  k ,  m,  p(-)  and  P*  choose  (ni,n2,hni)  to 

minimize  sup^j[T55|'p2] 

A€0 

subject  to  inf  Px[CS\V!A  >  P* ,  (77) 

fcn(f) 

where  (nl}n2)  are  non-negative  integers  and  /»„,  is  a  real  function  defined  as  before. 


5.3  A  lower  bound  on  the  probability  of  a  correct  selection 

The  so  called  LFC  of  the  population  parameters  for  general  two-stage  procedures  has 
not  been  determined  yet.  Moreover,  even  if  the  LFC  of  the  population  parameters  were 
known,  the  problem  of  evaluating  the  probability  of  a  correct  selection  associated  with 
P2  when  the  population  parameters  are  in  that  configuration  would  still  remain. 

However  it  is  possible  to  determine  a  set  of  constants  {ni,n2}  hni)  (although  not 
the  best  set)  to  implement  V2  if  a  lower  bound  to  the  probability  of  a  correct  selection 
can  be  found  and  the  LFC  of  the  population  parameters  can  be  determined  for  that 


lower  bound.  Such  a  set  of  constants  provides  a  conservative  solution  to  the  problem 
since  it  overprotects  the  experimenter  with  respect  to  the  probability  requirement  (75), 
this  overprotection  being  purchased  at  the  expense  of  an  increase  in  E^[TSS\V!^.  In 
this  section  we  consider  the  problem  of  a  lower  bound  on  the  probability  of  a  correct 
selection  and  the  infimnm  of  the  lower  bound. 

First  we  derive  a  lower  bound  for  P^CS\VQ  in  Theorem  5.1.  This  lower  bound  is 
particularly  useful  since  it  achieves  its  infimnm  over  Q(p)  at  Q°(p).  This  result  permits 
us  to  construct  a  conservative  two-stage  procedure  which  guarantees  the  probability 
requirement  (75). 

Lemma  5.1  For  any  X  €  Cl,  we  have 


PxlTllU  >  ] 

-EE/  n  <Wa) 

p-k-m  °°  jeSS(k) 

■  II  {^(M»))  -  <*?(»))<«?!;>(!,) 


pj[tw.  >  TWn,  j<t]=r  ff  G«(,)dG<.‘>(»), 

J—oo  •  , 


where  n  =  r»i  +  n2, 


{s;(0.  i  =  i (V)> 


denote  the  collection  of  subsets  of  size  i  from 


(78) 

(79) 


sjw = «(0  -  sj(0 

and 

<%'(  v)  =  G.fe|AM). 

Proof 

The  proof  of  Lemma  5.1  is  in  Santner  (1973).  □ 


Theorem  5.1  For  any  A  6  (l,  toe  have 


■  II  {gS?(^(>))-g!?(»)}<k3!!,(!() 

+  rnM-i.  (so) 

J-°°  3=  1 

Proof 


PxlCSm  =  PS|^>m«{I^M1Iw,A->(^)}, 
r(*)»  >  To).] 

>  >  max(^‘2  m+l]r»i »  <'(4h,)}  ■ 

>  /we.  s  i 

+^xP(*)»  ^  r(i)n»  i  <  *]  -  !>  (81) 

and  hence  the  result  comes  from  Lemma  5.1.  □ 

Next  the  infimum  of  the  lower  bound  will  be  considered  in  Theorem  5.2.  Lemma  4.1 
due  to  Mahamunulu  (1967)  and  Alam  and  Rizvi  (1966)  will  be  needed  again. 

Lemma  5.2  For  any  A  €  fl(p),  we  have 

PxKU  2  „)}]  =  mf 

where 

*.(A,n.)  .=  F  {Gn(hn(y) WA))}‘-‘ 

J-oo 

•f  < k-m' m}dG - (m 


and 


inf  >  T(j)n,  j  <k)=  inf  ^2(A,n), 


=  /  {G„(sWA))}‘-VG.(!,|A). 

J  — oo 


Proof 


The  first  part  of  this  lemma  was  proved  in  Santner  (1975).  To  prove  the  second 
part,  it  suffices  to  show  that  for  all  A  €  fl(p), 


Define 


Then 


Px[T(k)n  >  T(j)n,  j  <k)>  inf(’52(A,n). 


I  0;  otherwise. 


PX lrW"  -  r0)»»  3  <k\  =  %[,?(T)]- 


By  Lemma  4.1,  it  suffices  to  show  that  jj(T)  is  non-increasing  in  T(/)n  for  all  l  <  k.  Let 
us  define  T'  such  that 

T\i)n  >  T(i)n  and  T\j)n  =  T(,)„  V  j  ±  /,  j  <  k. 

Then  it  suffices  to  show  that  jj(T)  =  0  implies  ^(T7)  =  0.  Suppose  that  r?(T)  =  0. 
However, 

’j(T)  =  0 


if  and  only  if 


and  this  implies  that 


r(*)n  <  T(j)n  for  some  j  <k 


T(*)„  <  T(/)n  ==►  T(k)n  <  T[l)n 


T(k)n  <  T(i)n I  j  ^  /  =►  T(k)n  <  T{j)n,  j  ±  l. 


Hence,  both  cases  imply  tj(T)  =  0.  So  we  get 

-  r(i)“>  i  <  *1 

>  r  {G.(»WAw))}*-VG„(!,iVi) 

«/— oo 

=  'MAw,n) 

which  completes  the  proof.  □ 

From  Theorem  5.1  and  Lemma  5.2,  we  can  get  the  following  result  about  the 
irfimnm  of  the  lower  bound. 

Theorem  5.2  For  any  X  6  0(p),  we  have 

inf  Px[CS\V%  >  inf  ^(A.m)  +  inf  tf2(A,n)  -  1,  (82) 

X€o(f)  A  1  ”  a«a*  v  '  >€A' 

where  liri(A,n1)  and  ,®r3(A,  n.)  are  defined  in  Lemma  5.2. 

Remark  5.1  For  the  special  cases  of  location  and  scale  parameter  problems,  the  infi- 
mum  of  the  lower  bound  is  independent  of  X. 

(1).  Location  parameter  case;  In  this  case, 

Gn(x)  =  G„(x  -  A),  -oo  <  x,  A  <  oo, 


the  usual  choice  of  hn(-)  is 


hn(x)  —  x  +  dn ,  dn  >  0, 


and  the  preference  zone  is  given  by 


p{\)  =  A  -  6,  6>  0, 


that  is, 


ft(p)  =  (1(5)  =  {A|A[*|  -  A[*_i]  >  £}. 

Then  'I,i(A,ni)  and  ^a(A,n)  are  given  by 

®i(A,ni)s=«i(M„i,»i)  =  J  ^{Gnxiy  +  dni  +  t>)}k~1 

<83> 


*a(A,n)  =  #a(tf,n)  =  /  {Gn(y  +  ^)}*-1<iGn(y), 

«/— oo 


token  Gn(-)  is  the  cdf  of  T,n  when  A,-  =  0. 
fiy.  Scale  parameter  case;  For  this  case, 


Gn(x)  =  Gn(x/A),  x  >  0,  A  >  0, 


the  usual  choice  of  hn(-)  is 


A*(x)  —  ^  1» 


and  the  pnfennce  zone  is  given  by 

p(\)  =  \/6,  6>  1, 

t/iat  is, 

fl(p)  =  ft(5)  =  {A|A[*j  >  ^A^—x]}. 

Then  ®i(A,ni)  and  \t3(A,n)  are  given  by 

*i(*,'>i)  =  *i(<,=«„ni)  =  {G„1(cI,1{y)}*-1 


«,(A,n)  =  *3(«,n)  =  [°°  {G^Sy^-'dGM, 

J— oo 


when  (?»(•)  is  the  cdf  of  T,n  when  A,  =  1. 

5.4  Expected  total  sample  size  for  V2 

In  order  to  solve  the  non-linear  optimization  problem  ( 77  !wr  *v 
expression  for  the  E%[TSS  17^]  and  then  d;:iemum»  sup:*.  1 
total  sample  size  TSS  can  be  written  as 

TSS  ~Kn  ♦  "  . 

where  S  is  defined  in  (76)  The  result  <  <>n.  e~:  s  ..  < 
is  summarized  in  the  following  ttie..r»-r: 
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i  /vn 

i ;  / 1 

Theorem  5.S  For  any  X  £  Cl,  we  have 


k  h  1  (*”*) 

BjETSSre]  -  *«,  +  »,£[  E  E  /”  II  «!?(») 

iml  pmh-m  vml  jeSg(t) 

•  II  {<W(KM)-4?(t))<iGi!(r) 

-  r  n  Gtf(*i'(»))‘wsj(»)i- 

J-<x>  jml 

j¥‘ 

Proof 


EX[TSS\V%  =  f?x[(jbn1  +  na5)|^] 

—  Jmi  +  n3jE?j[5|^] 

=  kni  +  n2{Ex[Sm)  -  Px[5'  =  10$]}. 

Now  for  any  A  6  fl, 

*  n  (*;■)  ™ 

-EE  E  jT  n  oj?(r) 

»»1  pmh-tn  irm  1  °°  jeS£(i) 

•  n  w(*^(f))-ou(f)}«s!(f) 

ieSTw 

from  Theorem  (5.1)  in  Santner  (1975).  Hence  it  suffices  to  show  that 

px[5'  =  irn  =  E  r  n  G@(^(y))^(y)- 

<-l  fl 

Now 


Pj[S'  =  1|7^J  =  P^[exactly  one  population  is  sclected\V2) 
k 

=  ^2  PX[T(*)  *s  ^e  only  one  selected]. 

iml 

However, 

[x(j)  is  the  only  one  selected] 


iff 


95 


(87) 


(88) 


(89) 


(90) 


1 


’al  '.I  .I  .» 


Hece  the  result  in  (90)  holds.  □ 

Next  we  will  consider  the  maximum  value  of  E^[TSS\V^]  over  Q.  Conditions  which 
guarantee  that  the  supremum  of  £j[TSS|P£]  in  ft  occurs  at  some  point 

A  =  (Aj,  •  •  •  i  Air) 

for  which  A^j  =  A[*j  are  given  in  Theorem  5.4  and  a  general  expression  of  the  supremum 
of  E^[TSS[P^[  in  ft  under  these  conditions  is  given  in  Corollary  5.1. 

The  following  regularity  conditions  will  be  assumed  in  some  of  the  theorems  that 
follow. 

(i) .  E*  =  £,  for  all  A  €  A. 

(ii) .  For  any  [A1?  A2]  C  A  there  exists  e(y)  possibly  depending  on  Ax  and  Aa  such 
that 

\dGn(y\X)/d\\  <  e(y)  V  A  €  [Ai,Aj], 


where 


{f°  e(y)dG^hni(y)\Xl)}{r  e(y)JG.(y|A')}  <  oo  V  A'  >  Aa. 

J—OQ  J— OO 


Santner  (1975)  proved  the  following  lemma  in  which  conditions  are  given  which  guar¬ 
antee  that  the  supremum  of  in  ft  occurs  at  some  point 

A  ■  (Aj, ..M  A|) 

for  which  A(ij  =  Aj*j  where  S'  is  the  cardinality  of  the  set  /  in  the  first  stage  of  PJ. 

Lemma  5.3  If  regularity  conditions  (91)  are  satisfied  and  for  all  Ai,  Aa  in  ft  with 
Xx  ^  Aj 


_  3gaklM».(J^(,)| |A,)  >  0  a.t.  in  E*,, 


then  E$S'\VQ  is  non-deereasing  in  A^j  on 


A(A(„)  =  {A  €  A|A  <  A(jj} 


for  any  fixed  (Apj, . . . ,  Aw). 


We  will  consider  conditions  which  guarantee  that  the  infimum  of  Pj[S'  =  1|PJ] 
occurs  at  some  point 

A  *  (Aj,  . . . ,  As) 

for  which  A^j  =*  A[sj  in  the  following  lemma. 

Lemma  5.4  If  regularity  conditions  (91)  art  satisfied  and  for  all  Aj,  Aa  in  ft  with 
Aj  ^  Aa 


ag-ftZfaOIAQ 

d\i 


^t(y|Aa) 


-  <  0  «•«•  in 

then  Pf[S '  —  l|Pj]  is  non-increasing  in  A^j  on 

A(AW)  *  {A  €  A|A  <  AW} 

for  any  fixed  (Apjl . . . ,  Aw). 

Proof 

Fix  Apj, . . . ,  A[s]  for  the  following  argument.  Then 


where 


and 


pxl5#-i|P5]«r,(X)  +  ra(A)t 

!•■(*)-  L  n  gwwvwgzhv) 

JK*\  jm a 

r,(X)  -  E  /  n  Cif (*S(»))«S!(r). 

1 _ O 


m 

Now  r,(X)  can  bo  rewritten  aa 

ta%)  .  zL  oa’(*?(»))nGtf <*?(»))<«!?(») 
-  E/.  cg’W*))  n 


(93) 


Next  integrating  2\(X)  by  parti  we  obtain  that 


7\(A)  =r  constant  with  respect  to  A{i] 

-  L  E 

*•  a 

*  constant  toitA  respect  to  A^j 

-  L  oewz:c(?(*s(»)Wf(*sw)^t?U- 

*-*  ®y 

m 

Hence  combining  terms  it  follows  that 


pa^  -ira 

»  constant  tottA  respect  to  A^j 

+E/_  MWW) 

Wv*<>  J-* 
in 


and  finally 


adgi^iiai 

Aii 

*  ,  » 


E  A  n  <*?(*;?(»))  •  (aG<^(S(l'))jg(y) 

tmaJK*i  jmt  aAfr) 


But  (93)  gives,  for  every *  . . . , A, 

gggW(»)) 


dAw 


-yS?(y) 


and  hence  (94)  is  non-positive  and  this  completes  the  proof.  □ 


(94) 


Remark  5.2  We  can  easily  shot)  that  (9S)  and  (99)  art  equivalent  by  using  the  trans¬ 
formation  t  *  h^)(y)  in  (99).  Hence  the  conditions 

jMW>0 

<« w  ~ 

jfidgl=ilSt<o 

- 

ore  also  equivalent. 


From  Lemma  5.3,  Lemma  5.4  and  Remark  5.2  we  can  get  the  following  theorem  in 
which  conditions  guaranteeing  that  the  supremum  of  EX[TSS\VQ  in  0  occurs  at  some 
point 

X  *  (Aj,  * . . ,  An) 

for  which  Apj  *  A{*j  are  given. 

Theorem  5.4  Suppose  that  the  regularity  conditions  (91)  are  satisfied  and  for  all 

Ai,  A*  m(I  with  X\  <  A) 

-  !^1hW»)W^!!>o  «■»*,■  m 

Then  EtfTSSyPQ  is  non-decreasing  in  Ajjj  on 

A(Apj)  *  {A  €  A|A  <  Ajjj} 

for  any  fined  (Apj, . . . ,  Aw). 

Proof 

By  noting  that 


£j[TSS|PJ]  -  fcn,  +  nt{Ex[Sm)  -  PX[S'  -  1|PJ]} 
the  result  of  the  theorem  is  dear  from  Lemma  5.3,  Lemma  5.4  and  Remark  5.2.  □ 


Remark  5.3  Condition  (95)  reduces  to  the  requirement  of  MLR  in  the  locution  or 
seele  paremeter  problems. 


A  general  expreeiion  o i  the  snpremum  of  the  E$rSS\VQ  in  fl  under  the  condi¬ 
tion  (95)  ie  given  in  the  following  corollary. 


Corollary  5.1  For  every  fixed  Apj  <  . . .  <  Apj,  if 

All  - 


for  X^)  in  A(Apj],  then 

where 


nwp^xCTSSl'Pa  *  •up7(A,n1), 
Xeo 


7(A,»i) 

-<G.,(*?aoiA)}‘-I)<«..(rlA).  '  (*•) 


Furthermore,  if  the  hypotheses  of  Theorem  54  hold  for  Xx  m  Aa,  then  7(A,m)  is  non - 
decreasmy  ra  A  end  hence  if  there  is  e  greatest  dement  Ao  €  A,  then 

■op  e^tssypq  * 

Xeo 

Proof 

Thie  corollary  will  be  proved  by  using  the  following  three  lemmas  and  Remark  5.2. 
□ 

Lemma  5.5  For  every  fixed  Ap]  <  . . .  <  Apj,  if 

<&12S1>0, 

then 

sup  EAS'YPi]  ■  supTiC^.n!), 

Xeo  *K 


100 


where 


7l(A,«i) 

-  k  j  {G„,  (,)|A)>‘-'/{s^i^;  i  -  m,  mJiC.,  (,|A).  (97) 

Furthermore  if  (95)  holds  for  Aj  =  A3  then  7i(A,t»i)  is  non-deereasing  in  A  and  hence 
if  there  is  a  greatest  element  Ao  6  A,  then 

aup£;I[5'|^]  =  7i(Ao,n1). 

Uq 

Proof 

Santner  (1975)  proved  this  lemma  by  using  the  following  lemma  due  to  Gupta  and 
Panchapakesan  (1972)  which  gives  sufficient  conditions  for  the  monotone  behavior  of 
the  7i(A,r»i)  and  hence  the  proof  will  be  omitted.  □ 


Lemma  5.4  Let  {/’(•|A)|A  €  A}  he  a  family  of  absolutely  continuous  distributions  on 
the  real  line  with  continuous  density  /(*|A)  and  ¥(x,A)  a  bounded  real  valued  func¬ 
tion  possessing  first  partial  derivatives  and  with  respect  to  x  and  A  respectively 
and  satisfying  regularity  conditions  (99).  Then  £a[4i(x,  A)]  is  non-decreasing  (non¬ 
increasing)  in  A  provided  for  all  A  6  A 


(98) 

The  regularity  conditions  for  Lemma  5.6  are  given  as  follows. 

(i) .  For  all  A  €  A,  is  Lebesgue  integrable  on  ft. 

(ii) .  For  every  [Ai,  Aa]  C  A  and  As  €  A  there  exists  h(y)  depending  only  on  A,,  t  = 
1,2,3  such  that 

*/  .«  ,$•(*, A)  dF(xlA) d«(x, A3)  ,, 

/(*|AS) — Jj-4 - -  <  A(x)  VA€  [A,,  A,]  (99) 

and  h(x)  is  Lebesgue  integrable  on  ft. 

Lemma  5.7  If  for  every  fixed  Apj  <  . . .  <  A[*j, 


for  Ajjj  m  A[AIaJ],  then 


where 


kWxP'  -  iim  =  &  lit*.".), 


•»(*,».)  -  *  jL  <a,(*S(»)W}*-1«»,(»|A).  (100) 

Furthermore,  if  the  hypotheses  of  Theorem  5.4  hold  for  \\  =  Aa<  then  -fe(A,ni)  is 
non-increasing  in  A  and  hence  if  there  is  a  greatest  element  Ao  €  A,  then 


^X«O^#=l|^]  =  72(A0,ni). 


Proof 


It  suffices  to  prove  for  all  q  <  k  and  fixed 

A{f+i]  <  •  •  ’  <  AW 

that 

PhJS' = 

is  non-increasing  in  A  on  AfA^i]]  where 

A(t) a  (A, . . . ,  A,  A^i], ...| Ajj,]). 

Let 

A  *  (Apj,  •  •  •  *  ^]) 

and  note  from  (90)  that  P^{S'  =*  lj is  invariant  under  permutations  of  the  ele¬ 
ments  in  X'.  So 

-  m)  m  ±  dpx,{s> = nn) 

5Aw  X(«) 


aAPl  X(,)’ 


But  from  the  proof  of  Lemma  5.4, 


S'*  1 


r.  iV  *  rr 


Hence  the  infimum  over  0  of  the  -Pj[5"  —  1| VQ  occurs  at  some  point  where  all  the  Apj’s 
axe  equal.  Since 

‘*(Afm)  =  2?[*(y,A)] 


for 

*(,,A)  -  «’(»)! A)}*'1, 

Lemma  5.6  can  be  applied  and  the  sufficient  condition  (98)  that  tj(A,  rix)  be  non- 
increasing  reduces  to 


v)IA) 
dx 


9* 


dG. 


d\ 


<  0  V  A,  o.e.  y. 


The  final  part  of  the  result  is  obvious.  □ 

Remark  5.4  For  the  cases  of  location  or  scale  parameter  problems  the  supremum  of 
the  EX[TSS[PQ  in  fi  is  independent  of  X  provided  the  conditions  in  Theorem  5.4  and 
Corollary  5.1 .  Under  the  same  framework  of  Remark  5.1,  we  have 
(1).  For  the  location  parameter  case: 


sup  EX[TSS\7>$ 

X«Q 

=  tni  +  *n,^HGw(»  +  <t„  )}*■'/{ g^J^y;t-m,m} 

~{G«  (» -£)>*-■]<«,(»),  (101) 


where  (?«(•)  is  the  cdf  of  the  Tin  when  X < 
(2).  For  the  scale  parameter  case: 

•  sup  £j[TSS|P|] 


*"l  +  **1  -  n>,m) 

-{G^vM}l-lvaK(y), 


(102) 


where  (?„(•)  is  the  cdf  of  the  T*,  when  Aj*l. 


5.5  An  optimization  problem  and  the  performance  of  V2 

In  this  section  we  first  consider  the  optimization  problem  (77),  which  one  must  solve 
in  order  to  determine  (ni,na,  An,)  which  is  necessary  to  implement  V2  and  then  we 
consider  the  performance  of  V*2  relative  to  the  corresponding  single-stage  procedure  in 
terms  of  the  total  number  of  sample  sizes  needed. 

As  we  noted  in  Section  5.3,  the  LFC  of  the  parameter  vector  A  in  Q(p)  has  not  been 
determined  in  the  general  case  and  hence  we  replace  the  exact  infj€0  Pj[C5|7^]  by  the 
conservative  lower  bound  on  that  probability  given  by  the  right  hand  side  of  (82).  For 
the  special  case  of  location  parameter  problems  under  some  appropriate  conditions  the 
optimization  problem  (77)  can  be  written  as  follows. 

For  given  k,  m,  6  and  P*  choose  integers  nx  and  n2  and  a  real  dni  >  0  to 

minimize  kn2  +  kn2  £j{G* tiv  +  )}*”**{  k-m,m} 

-{G^y  ~'Z)}k~l]dGM 

subject  to  r°  {G*,  (y  +  dni+  <5)}*_1 

J— 00 

+£V<-.+« ,(y  +  <)}*-'*?, w+^,(»)  - 1  >  P\  (103) 

where  G»(-)  is  the  cdf  of  the  when  A*  =  0. 

For  the  case  of  scale  parameter  problems  under  some  appropriate  conditions  the 
optimization  problem  (77)  can  be  written  as  follows.  For  given  fc,  m,  6  and  P*  choose 
integers  r»i  and  n2  and  a  real  c*,  >  1  to 

minimize  knx  +  kn2  J^J\{Gni{cniy)}k-lI{7^^\  k-m,m} 
-{G^vM^dGM 
subject  to  r  {Gni(cni6y)}k~1 

flO 

I { k  -  m,m}dGn,(y) 

+  /^{G,„+W,(i»))‘-MGW+W,(v)  - 1  >  r,  (104) 

where  G„(-)  is  the  cdf  of  the  7*  when  A<  =  1. 


As  a  measure  of  the  efficiency  of  relative  to  that  of  the  corresponding  single-stage 
procedure  when  both  guarantee  the  same  probability  requirement  (75),  we  consider  the 
ratio,  termed  relative  efficiency  RE), 

Jfcn, 

where  n,  is  the  sample  size  needed  in  the  single-stage  procedure. 


5.6  Applications 

In  this  section  we  apply  the  results  of  previous  sections  to  a  problem  of  selecting  the 
population  with  the  largest  mean  from  k  univariate  normal  populations. 

Suppose  that 

Ti~  N(w,<r2),  i  =  l,...,fc, 

where  the  common  variance  a1  is  known  and  the  experimenter  is  interested  in  selecting 
the  population  having  largest  p,.  We  take 


1  n 
i 


A  tmp,. 

nJml 


Then 


where  9  is  the  cdf  of  a  N( 0, 1)  random  variable.  Since  this  is  a  location  parameter 
problem,  we  take 

M  =  8>0 

and 


ha 


so  that 


n(p)  =  {p\n[k)  -  >  s}- 


Noting  that  the  distribution  of  the  mean  of  a  sample  from  a  normal  population  has 
MLR  with  respect  to  the  location  parameter,  and  using  Theorem  5.2  and  Remark  5.1 


it  can  be  seen  that 


>  r  {*(»+* +«^?/<r)}‘-' 

</— oo 

7  <  *~m>  m)<»(y) 

+  ^{$(y  +  ^Vni  +  na/<T)}*-1<i$(y)  -  1, 


(105) 


and  using  Corollary  5.1  and  Remark  5.4, 


supEplTSSlPQ 

fiea 


=  kni  +  kn2j^J[{9(y  +  h)}k~lI{^^]k  —  m,m} 
-{*(V  "  h)}*-l)d*(y). 


(106) 


Hence  the  conservative  non-linear  optimization  problem  can  be  reduced  to  finding 
integers  ni,n2  and  a  real  number  k  >  0  to 

minimize  kni  +  kn2  j°°  [{$(y  +  h)}*~l/{yffi{y;  k-m,m} 

-{♦(V  ”  h)7~'mv)  (107) 

subject  to  J~  {$(y  +  /» +  Sy/n^/a)}*'1 

+  /  {*(y  +  tyn\  +  n7/er)}k"ld9(y)  -  1  >  P*,  (108) 

for  the  given  values  of  £,  m,  <5  and  P*. 

Table  30,  Table  31,  Table  32  and  Table  33  contain  the  real  valued  solutions  (ni,n2,  A) 
of  the  above  optimization  problem,  which  are  necessary  to  approximate  the  con¬ 
stants  (r»i,na,  h)  needed  to  implement  V2  for  P*  =  0.75,0.90,0.95,0.99,  k  =  3,4,5, 
m  *  2,3,4  <  k  and  S/a  —  0.1, 0.5, 1.0, 2.0, 4.0.  All  computations  were  carried  out  in 
double-precision  arithmetic  on  a  Vax-11/780.  The  source  program  in  Fortran  for  the 
SUMT  algorithm  given  by  Kuerter  and  Mize  (1973)  was  used  to  solve  the  non-linear 


optimization  problem  and  Gauss-Hermite  quadrature  with  twenty  nodes  was  used  to 
compute  the  integrals. 

Using  above  constants,  we  can  define  an  elimination  type  two-stage  procedure 
as  follows; 

Stage  1:  Take  ni  independent  observations 

xfj \  j  a 

from  each  x,-,  t  =  1, . . . ,  Jb,  and  compute  the  k  sample  means 


x£=r-i:4f).*'= i.-.*- 


Determine  the  subset  I  of  {1, . . . ,  k}  where 


>  max 


where 


*&.*•••**&. 

denotes  the  ordered  Denote  by  x/  the  associated  subset  of  {xlt . .  .x*}. 

1.  If  X/  consists  of  one  population,  stop  sampling  and  assert  that  the  population 
associated  with  is  best. 

2.  If  x/  consists  of  more  than  one  populations,  proceed  to  the  second  stage. 

Stage  2:  Take  n3  additional  independent  observations  Xjj\  j  =  l,...,nj,  from 
each  population  in  x/,  and  compute  the  cumulative  sample  means 

xi.=i(i;4)+x:4;,)v<€/, 

n  jm  1  i«l 

where  n  =  rii+nj.  Assert  that  the  population  associated  with  maXi€/Xjn  is  best  using 
randomization  to  break  the  ties  if  necessary. 


The  relative  efficiency  RE  of  the  two-stage  procedure  relative  to  the  correspond¬ 
ing  single-stage  procedure  is  given  by 

A®  =  TT“[^i  +”2^1  I™  {  5Z  II  *(>  + 

*»>,  pmk—m  <«i  j€sg(i) 


(109) 


*  n  {*(?  +  h  +  y/hlSij/a)  -  9(y  +  y/OiSij/*)} 

ie9£(0 

-  II  *(?  -  *  +  >fii6ij/<r)}d9(v)], 

53 


where  (r»i,  n3,  k)  is  the  real  valued  solution  of  the  non-linear  optimization  problem  (107) 
and  (108), 

*«>  =  /*p|  - 

5J(»)  and  3£(»)  are  defined  as  in  Lemma  5.1  and  n,  is  the  real  solution  to 

h-l 


r  {*(*  +  \fc*l°)}  "  d9(z)  =  P*. 

OO 


Of  course,  JtE  depends  on  S  and  P*  as  well  as  Ji. 

Table  34  and  Table  35  contain  the  values  of  the  relative  efficiency  for  the  two  special 
cases,  namely  the  equally  spaced  and  slippage  configurations,  for  P *  =  0.75, 0.90, 0.95, 0.99, 
k  —  3,4,5,  m  =  2,3,4  <  k  and  6/<r  =  0.1, 0.5, 1.0, 2.0, 4.0.  All  computations  were  car¬ 
ried  out  in  double-precision  arithmetic  on  a  Vax-1 1/780  and  Gauss-Hermite  quadrature 
with  twenty  nodes  was  used  to  compute  the  integrals. 

From  Table  34  and  Table  35,  we  see  that  for  both  configurations  the  values  of  RE 
are  less  than  or  equal  to  one  except  for  some  smaller  values  of  k,  m  and  P*.  Hence  the 
two-stage  procedure  is  more  efficient  than  the  single-stage  procedure  in  terms  of  the 
expected  total  sample  sizes.  Furthermore,  the  effectiveness  of  the  two-stage  procedure 
appears  to  be  increasing  as  each  of  k,  m  and  P*  increases  for  fixed  values  of  the  others. 


Table  30:  Constanta  to  implement  the  two-stage  procedure  V%  for  selecting  the  largest 
normal  population:  P*  =  0.75. 


P*  m  0.75 


OLIO 

03135*403 

07381*403 

03391*401 

0785194*403 

050 

Om  90*4-01 

03811*401 

04964*401 

0314109*403 

1.00 

03150*4-01 

07019*400 

04907*401 

0.785374*401 

2.00 

015373*400 

01758*400 

04998*401 

0196318*401 

4.00 

01343*400 

04397*01 

04999*401 

0490786*400 

aio 

0^2445*403 

03163*403 

01883*401 

0138739*404 

050 

09004*401 

07900*401 

04974*401 

0558164*403 

\xo 

04480*401 

01973*401 

04998*401 

0130041*403 

3.00 

00319*400 

04941*400 

04909*401 

0347603*401 

4m 

01555*400 

01334*400 

05001*401 

0868007*400 

010 

03043*403 

01386*403 

03474*401 

0119643*404 

050 

08110*401 

05118*401 

04964*401 

0478336*403 

1.00 

03039*401 

01381*401 

04887*401 

0119657*403 

3.00 

05073*400 

03300*400 

04996*401 

0398891*401 

4.00 

01388*400 

07897*01 

04989*401 

0747339*400 

aio 

03717*403 

03554*403 

02348*401 

0108350*404 

050 

01007*403 

01343*403 

04998*  101 

0706946*403 

1.00 

03745*401 

03099*401 

04999*401 

0199336*403 

3.00 

08882*400 

07753*400 

05003*401 

0498091*401 

4.00 

01715*400 

01839*400 

05000*401 

0134533*401 

010 

03388*403 

03159*403 

03957*401 

0177798*404 

050 

08038*401 

08648*401 

04985*401 

0711354*403 

1.00 

03359*401 

03164*401 

04996*401 

0177839*403 

3.00 

05450*400 

05403*400 

05003*401 

0.444596*401 

4.00 

01413*400 

01350*400 

05001*401 

0111149*401 

010 

01894*403 

01885*403 

03067*401 

0169006*404 

050 

07837*401 

07333*401 

0307 i;  401 

0676307*403 

1.00 

01980*401 

01780*401 

04957*401 

0260333*403 

3.00 

04906*400 

04444*400 

04969*401 

0433056*401 

<00 

01335*400 

01113*400 

04906*401 

0106764*401 

.  i  ,  iVtu  t  *  -  i.1  «»  u.  * 


Table  31:  Constanta  to  implement  the  two-stage  procedure  “PJ  for  selecting  the  largest 
normal  population:  P*  »  0.90. 


rgyglgT  u.wo. 


F*  m  0.80 


oasir*403  01081*401 
08030*401  OMMtHB 

01373*401 
ft  wain  oo 

ft  1331  *400 


0883011*408 

oisn  70*400 


* 


fti«m>m 
01831*400  I  0.100*1  01 


03381*400 

03081*400 


&32S0*H» 

01303*408 

03007*401 

ftrrsr*400 

0.180*400 


0  4030*400 

o-iraii  i  oa 


0.4887*401 


ft  41 88*401 


EL 


0.1418*401 


0l34S878*400 

004188*401 

0183847*401 


0338874*404 

0818884*400 

0381187*408 

0877884*401 

0144488*401 


0l3384S7*4O4 

0133730*408 


01388*488 
03473*401 
0.88801 1  00 
018884  100 


0.4483*401  08001*401  |  0334330*408 

01114*401  01008*101  ' 

03787*400  08001*401 


04138*400 

01808*108 

04031*401 

01008*401 

03813*400 


04088*403  01887*401 

01180*  I  03  01083*401 

03800*401  01883*401 

08008*400  01884*401 

03480*400  O  WOO  *  I  01 


01313417*404 

0138303*408 

0313488*408 

0783844*401 

0188011*401 


0313107*404 

0138341*403 

0313108*403 

0783780*401 

0188308*401 


Table  32:  Constant*  to  implement  the  two-stage  procedure  for  selecting  the  largest 
normal  population:  P*  *  0.95. 


Table  33:  Constanta  to  implement  the  two-stage  procedure  for  selecting  the  largest 
normal  population:  P*  *  0.99. 


09837*409 

asrssi  i  oa 

01397*401 

0  300034*4  04 

09708*408 

aaassi  i  o» 

01394*401 

0148307*409 

asassi  i  oi 

04730*401 

01300*401 

0301010*408 

01347*401 

dlMlafOl 

0090110*401 

01739*400 

03117*400 

O4soo*40i 

0394089*401 

aioM*w 

osru*408 

aiassi  i  <n 

0034990*404 

a«oo»toa 

03877*408 

aisasiim 

0310094*409 

oicns*4aa 

09309*401 

0090891*403 

o.3sa»»»oi 

anss»ioi 

asom*40i 

0193070*403 

09393*400 

0.38QSi  |  00 

0.931 430*«0I 

assao*409 

amoi*»o9 

alias*  im 

0017079*404 

03439*403 

oaan*408 

01*79*401 

03070894  109 

0.0003»401 

07997*401 

a  1177*4-01 

0017073*403 

0.3141*401 

oi7to*4Ci 

aisroMOi 

01399091 1  08 

04931*400 

04070*400 

a  isss>  ioi 

0390470*401 

01094*404 

07190*400 

ai»3a«  >  oi 

00000*0*404 

04909*403 

aasssi  i  os 

0*037*401 

0370433*4-09 

aiossii  oa 

07334*401 

0^007*4-01 

0009111 1  1  08 

03730*401 

oiso7*40i 

01001*401 

0173199*403 

00000*400 

04017*400 

04000*401 

0490947*401 

asssoiioa 

0.9004*409 

aiil  4*401 

0099944*404 

assn  *hb 

03319*403 

01810*401 

0300037*409 

09040*401 

0.00*0*401 

04000*401 

0009030*403 

03190*401 

osoisa+oi 

04007*401 

0.107307*403 

00340*400 

o  imt»  i  on 

(hMOOtfOi 

ansin*40i 

o.sss>«i  oa 

07841*403 

01019*401 

0000010*404 

0.9898*403 

03144*403 

01031*401 

0373009*409 

assn  *+oi 

otss9*40i 

01030*401 

P 

I 

I 

03310*401 

01994*401 

01033*401 

0170004*403 

00037*400 

a 4009*4-00 

01030*401 

0430009*401 

1.00 

3.00 


Table  34:  Relative  efficiency  of  the  two-stage  procedure  for  selecting  the  largest 
normal  population:  Equally  spaced  configuration. 
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